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PREFACE 


The present volunae contains a complete translation, made 
in consequence of a suggestion by my eminent friend, Professor 
E. T. Whittaker, F.R.S., of the Italian text of my Lemni di 
cahoh differmziale assoluto.^ Two new chapters have been added, 
which are intended to exhibit the fundamental principles of 
EiiLstein’s General Theory of Relativity (including, of course, 
as a limiting case, the so-called Special or Restricted Theory) 
as an application of the Absolute Calculus. 

I have already had occasion to remark in the Preface to the 
Italian edition that we possess various systematic and well- 
written expositions of Relativity by celebrated authors. The 
short treatment which is offered in the two new chapters of the 
present work presents some distinctive features which it may 
be well to point out explicitly. 

In the first place, in order not to increase the size of the book 
unduly, I have thought it expedient to confine myself to tracing 
the relativistic evolution of Mechanics (properly so called) and 
of Geometrical Optics, and to developing its most important 
consequences. In this treatment the whole of Electromagnetism 
is sacrificed. The sacrifice is certainly regrettable, since Electro- 
magnetism was historically related in the most intimate way to 
Einstein’s conception, having served indeed as the support and 
model for Restricted Relativity. Furthermore, Electromagnetism, 
in common with every other physical phenomenon, now comes 
within the ambit of General Relativity. Much as the omission 
of Electromagnetism is to be regretted, it has the advantage 
of reducing the programme to subjects belonging to the pure 
Newtonian tradition (or to its developments); and it allows us 
to take a clearer and more exact view of the transition from the 
classical scheme of Mechanics to the relativistic one. 

^ Compiled by Dr, Enrico Persioo (Rome, Stock, 1925). 
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PREFACE 


For this reason I have followed the method — which I have 
adopted sometimes in lectures or articles on special subjects — 
of taking the classical laws as the starting point and then of 
trying to find inductively what modifications — negligible in 
ordinary circumstances — should be introduced in order to take 
account of Einstein’s ideas; and in the first place, naturally, 
to take account of his Principle of Relativity, that is to say, the 
invariant behaviour of these laws under all .transformations of 
space and time, an auxiliary four-dimensional ds^ being duly 
employed. This method has seemed to me to be preferable to 
the procedure of enunciating the postulates of relativistic 
Mechanics in abstract tensorial form, which is so comprehensive 
in physical content as to be almost inaccessible to ordinary 
intuition, except with ample comment and illustration. 

A further characteristic of our exposition is that we make 
extensive use not only of geometrical representation but also 
of the differential properties pertaining to the space-time con- 
tinuum; attention is drawn also to the special importance of 
the Einsteinian statics, the treatment being rigorous in some 
cases, while in others which involve fields variable with the time 
it is approximate. 

In closing this introduction to Chapters XI and XII I would 
add that they were prepared, still in collaboration with Professor 
Persico, at the suggestion of Mr. F. F. P. Bisacre, M.A. 

In connexion with the whole of the English edition, I must 
warmly thank the translator, Mias Marjorie Lung, formerly 
Scholar of Girton College, who with double competence, scientific 
and linguistic, has known how to combine scrupulous respect 
for the text with its eft’ective adaptation to the spirit of the 
English language. 

I owe hearty thanks also to Dr. John Dougall, who, while 
revising the proofs, has checked the analysis throughout, detected 
some oversights, and made many useful suggestions for improve- 
ment. I wish finally to thank my English publishers, who have 
not only acceded to, but almost always anticipated, my wishes 
in regard to symbols and the typography of the book. 


T. LEVI-CIVITA. 
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Eiemann’s general metric and a formula of ChristofEel con- 
stitute the premises of the absolute differential calculus. Its 
development as a systematic branch of mathematics was a later 
process, the credit for which is due to Ricci, who during the ten 
years 1887-1896 elaborated the theory and worked out the 
elegant and comprehensive notation which enables it to be easily 
adapted to a wide variety of questions of analysis, geometry, 
and physics. 

Ricci himself, in an article published in Volume XVI of the 
Bulletin des Sciences MatMmatiqms (1892), gave a first account 
of his methods, and applied them to some problems in differential 
geometry and mathematical physics. Later on other interesting 
applications, made by himself or his students (to which group 
I had the privilege of belonging), suggested the desirability of 
prejiaring a general account of the whole subject, including 
methods, results, and a bibliography. This was the origin of the 
memoir “ M4thodes de calcul diff^rentiel absolu et leurs appli- 
cations ”, which was compih'd by Professor Ricci and myself in 
collaboration, on the courteous invitation of Klein, and appeared 
in Volume 54 of Math. Ann. (1901), 

There is a chapter on the foundations of the absolute calculus, 
with special reference to the transformation of the equaf ions of 
dynamics, in M^right’s Tract, Invariants of Quadratic Differential 
Forms (Cambridge University Press, 1908); apart from this, 
while special researches based on the use of this method were 
continued after 1901 by a limited number of mathematicians, 
yet general attention was not again directed to it until the great 
renaissance of natural philosophy, due to Einstein, which found 
in the absolute differential calculus the necessary instrument 
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for formulating the new ideas mathematically and for the sub- 
sequent numerical work, 

Einstein’s discovery of the gravitational equations was an- 
nounced by him in the famous note “ Zur allgemeinen Relati- 
vitatstheorie ” ^ in the following words: Sie bedeutet einen 

wahren Triumph der durch Gauss, Riemann, Christoffel, Ricci . . . 
begrtlndeteri Methoden des allgemeinen Differentialkalculus/’ 

In an earlier memoir Einstein had given a new exposition of 
those elements and formula) of the absolute calculus which more 
specifically served his purposes. A similar standpoint was sub- 
sequently adopted by the most distinguished workers in the field 
of general relativity, in particular by Weyl,*-^ Laue,® Eddington,^ 
and Birkhoff,® all of whom made conspicuous original contribu- 
tions, both of idea and of method, to the physical theories, in 
addition to useful and elegant developments of the tensor calculus. 
Similar statements can be made for Carmichael,® Marcolongo,^ 
Kopff,® Becquerel — to mention, from the vast literature on the 
subject, only the books 1 have myself had occasion to consult 
- — while de Bonder has avoided the notation of the -absolute 
calculus and used instead the theory of integral invariants. 

Iti recent yt^ars there have been some general treatises devoted 
to the absolute calculus; for instance, those of Juvet,^^ Marais, 
and Galbrun.^® Lastly, there is another calculus, in a new order 
of ideas, not less comprehensive and perhaps even more general, 
invented by Schouten, and developed with the collaboration 
of Struik.^*^ 

In face of this plentiful and valuable literature a new dis- 
cussion of Ricci’s methods might seem to be superfluous; and 
conceptually this is perhaj^ true. 

In fact, of the improvements and additions to the scheme 
of 1901 (tlie memoir in Math. Ann.), derived mainly from the 
notion of f)arallelism and on this basis introduced by me into 
two courses of lectures given at the University of Rome during 
the sessions 1920-1921 and 1922-1923, all, or almost all, will 
be found as independent discoveries of the authors already cited, 
in one or other of their books. 

For instance, the definition of a tensor, and some algebraic 
anticipations of the results intended to simplify the proofs, are 
to be found in Weyl, Laue, and Marais, all of whom, like Edding- 
ton, establish a more or less intimate connexion between co- 
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variant differentiati<m and parallelism. A thorougli discussion 
of the latter is also given by Juvet and Galbrun, But the associa- 
tion with the algebraico-tensorial notation and with the elements 
of differential geometry is always less detailed and systematic 
than what I tried to establish in my lectures. The line of argu- 
ment followed in them has a particular unity, which may perhaps 
justify their appearance in print at this juncture. 

The manuscript was edited with great care and intelligence 
by Dr. Enrico Persico, from notes of the lectures. 1 wish to express 
my thanks to him for his valuable help, and to my publisher, 
Signor Stock (who also attended the lectures), to whose continued 
encouragement the existence of the book is due. 

TULLIO LEVI-CIVITA. 

Home, December. li)2S, i. 
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NOTE TO 

SECOND ENGLISH IMPRESSION 

Advantage has been taken of a reprint to correct a few 
errors and to add references to some recent 
work (see p. 441). 

T. L.-C. 

Rome, November, 1928 • 


PUBLISHER’S NOTE 

Professor TuIIio Levi-Civita died in Rome on the 29th of 
December, 1941. 

An appreciation of his work was published in the Atti della 
Accademia Nazionale dei Lincei, Serie Ottava, Vol. I, Fascicolo 
II, November, 1946, with a list of his 204 scientific publications- 
This volume includes the Author^s last revisions of the 
English Version. 


April, 1947. 
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THE ABSOLUTE 
DIFFERENTIAL CALCULUS 


PART I 

Introductory Theories 

CHAPTER I . 

Functional Determinants and Matrices 

1. Geometrical terminology. 

In analytical geometry it frequently happens that compli- 
cated algebraic relationships represent simple geometrical pro- 
perties. In some of these cases, while the algebraic relationships 
are not easily expressed in words, the use of geometrical language, 
on the contrary, makes it possible to express the equivalent 
geometrical relationships clearly, concisely, and intuitively. 
Further, geometrical relationships are often easier to discover 
than are the corresponding analytical properties, so that geo- 
metrical terminology offers not only an illuminating means of 
exposition, but also a powerful instrument of research. We 
can therefore anticipate that in various questions of analysis it 
will be advantageous to adopt terms taken over from geometry. 

For this purpose it is essential to adopt the fundamental 
convention of using the term point of an abstract n-dimmsional 
manifold (n. being any positive integer whatever) to denote a 
set of n values assigned to any n variables a^, x^, . . . This 
is an obvious extension of the use of the term in the one-to-one 
correspondence which can be e.stablished between pairs or trip- 
lets of co-ordinates and the points of a plane or space, for the 
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cases n = 2 and w = 3 respectively. For the case of n vari- 
ables we can thus also speak of a field of points (rather than of 
values assigned to the a:;’s), and of the region round a specified 
point Xi {i — 1 , 2, . . . n). 

If the a?’s are n functions (i) of a real variable then when 
t varies continuously between and we get a simply infinite 
succession of points, the aggregate of which (as for w — 2 and 
n 3) is called a line, and more precisely an arc or segment of 
a line. 


2. Functional determinants and change of variables* 

Let there be n functions of n variables: 

^ 2 > • • • 


the functions and their derivatives to any required degree being 
supposed finite and continuous in the field considered. 

To simplify the notation, let x (without a suffix) represent 
not only (as is usual) any one of the n variables O/g, . . . 2*^,^ 
but also (as is sometimes done) the whole set of them; and 
similarly for other letters which will be used farther on. With 
this convention the given functions can be written in the abridged 


With the usual notation, fhefimctional determinant or Jacobian 
of the It’s is the determinant of the wth order whose terms are 
the first derivatives of the i.e. 


du^ 


du^ 

dx^ 

0*2 

3x„ 

0^2 


9^2 

dx^. 

3X2 “ ■ 

3x„ 


3«« 



3x2 ■ ■ ■ 

3x„ 


Such a determinant is 
notation 


sometimes represented by the abridged 
^2 • • • 
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analogous to that used for fractions and substitutions, the set 
of functions u representing the numerator and the set of vari- 
ables X the denominator of a fraction. The analogy of form 
is justified by the analogy of properties, as can be seen by con- 
sidering the effect on a functional determinant of a change of 
variables. For let the be functions of n variables 


. . . yn)A 


= ^niyv • • - yn)J 


( 1 ) 


and suppose further that these equations represent a reversible 
transformation, i.e. that they also define the as functions 
of the sc’s, or, in other words, that they are soluble with respect 
to the If then the are considered as functions of the 

y^B ^being given in terms of the which are functions of the 
arid the corresponding functional determinant 





is formed, it will be found, as will be shown below in § 4, 
that — D multiplied by the determinant of the functions 
defined by equations (1), i.e. by 


A 


/^ • • • ^n\ 

Vvi • • • yJ 


3. The fundamental theorem on implicit functions. 

Before proving the theorem just referred to, we must recall 
a fundamental theorem relating to implicit functions. It is known 
that a relation between two variables of the type 

f{y, x) = 0 

defines y as a function of x, provided certain suitable qualitative 
conditions are satisfied.^ A classical form iOf the conditions 
sufficient for solubility is as follows. Let a?®, y*, be a point at which 
f vanishes, f being finite in a (plane) region I round the point. 

Let exist in I and be not zero for x = y = y^. Then 
dy 

^When an equation is said to Im; ** soluble”, this will not necessarily mean that 
the process of finding an algebraic solution can be carried out. 



4 


INTRODUCTORY THEORIES 


in a certain (linear) region round the value ac® the given equation 
defines a continuous function y{x) such that x) vanishes 
identically. 

For implicit functions of several variables the following 
theorem, which is a generalization of the one just stated, holds. 

Let there be given n equations between ^ variables y and 
any number of variables x ot the form 

f,{y\x) = 0 (i 1,2,... n). 

Let there be a set of values rr®, which satisfy tliese equa- 
tions; in a region round the point x®, j/*, let the /’s and their 
derivatives with respect to the y ^ be continuous, and let the 
determinant 


be not zero. Then the given equations define the «/’s as functions 
of the in a region round the set of values 

It will be seen that from a certain point of view the func- 
tional determinant of several functions of the same number of 
variables constitutes a liatural generalization of the derivative 
of a function of one variable. This will follow explicitly from the 
applications of the following section. 



4. Effect on a functional determinant of a change of variables. 


Consider first the (sufficient) condition of solubility of the 
sot of equations (1). Write the equations in the form 

' • • ?/J — 0 {i = 1, 2, . . . n). 


and suppose that there exists at least one set of values of the 
2 /’s and the .r’s which satisfy them and for which the functions 
Xi{y) and their derivatives are continuous. Then, to apply the 
preceding theorem, we must calculate the partial derivatives of 
the left-hand side of each equation with respect to the j/^s, and 
form their determinant. But these derivatives are the terms 


respect to the s is 


n), and hence the condition of solubility with 



FUNCTIONAL DETERMINANTS AND MATRICES 5 

Now take the theorem stated in § 2, and suppose A =H 0. 
Multiply together the two determinants D and A, i.e., inter- 
changing rows and columns in A, form the product 


du^ 

du^ 




d X2 

dx„ 

dxi 


9 a:» 


^yi 

^yt 

“■ Hi 

du^ 

3«2 

3^2 


dx^ 


0 *„ 

dx^ 

* * ‘ 


X 

9^2 

^yz 

”■ Hz 



• • • 


Bxi 

• • • 

0*2 

# • • • ■ ■ 

0*,. 

0 a:j 


dx,. 


^y,i 


”* Hu 


Applying the ordinary rule, the product by rows gives as 
the typical element a^g of the resulting determinant the expression 

£ du^ dx^ du^ 

x'dXi dy, dy, 

(remembering the rule for differentiating a function of one or 
more functions). Hence, as already stated, the product is the 
determinant This result is expressed by the formula 



which justifies the use of this notation for the funcLional deter- 
minants. 

5. The necessary and sufficient conditions for the independence 
of n functions of n variables. 

If therefore the functional determinant of n functions of n 
variables dot\s not vanish identically, it follows that this pro- 
perty still holds when the original variables are rej)laced by 
others related to the first set by the transformation (1) (with 
the condition A =4= 0); in other words, this is an invariant property. 
The following definition may therefore be given: 

Definition. — n functions of n variables are said to he indejmi- 
dent when their functional determinant does not vanish identicaJJiy, 

The reason for applying the word independence ” to this 
property is shown by the following theorem. 
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Theorem. — Given n functions u of n variables x, the necessa/ry 
and sufficient condition for the non-existence of any (differentiable) 
relation between them of the type 

f(u^, U 2 , . - . = 0 . . . . (3) 

involving only the u’5 and not the x’s, is that their functional deter- 
minant does not vanish identically. 

We shall first show that the condition is sufficient; then that 
it is also necessary, but for the moment confining the proof to 
a particular case; the theorem in its general form will be shown 
farther on to be itself only a particular case of another still 
more general theorem (cf. § 7). 

Suppose the condition satisfied 


We shall then show that no relation of the type (3) can exist, 
(Identities are of course not considered; i.e. we exclude the case 
where equation (3) is satisfied when arbitrary values are assigned 
to the ?f’s, as it would not then represent any relation between 
the 1 . 6 ’s.) Suppose that such a relation does exist. Differentiating 
with respect to we should get n equations 


T“ d u^ dxi 


(i ^ 1 , 2 , 


n). 


linear and homogeneous in the derivatives 


5 / 




Now since by 


hypothesis f is a true function, not zero identically, these deri- 
vatives are not all zero. Hence the determinant of the coefficients 
of this group of equations vanishes; i.e. D = 0, which is con- 
trary to our hypothesis. The condition (4) is therefore wsufficient 
to secure the non-existence of any relation of the type (3). 

To prove that condition (4) is necessary, we shall show that 
if it is not satisfied, i.e. if 

= 0, (5) 


then the u's are connected by a relation (at least one) of the 
(3)* f'h® moment the only case considered will be that 
in which at least one of the minors of order n — 1 of the deter- 
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minant D does not vanish. This minor will in general be of the 
type 

where Pi, . . . p,,, ^ i and ?i, 1 represent any two arrangements 

ol n — 1 integers chosen without repetitions from the numbers 
1, 2, .... w. But since the order in which the ir’s and u^s are 
made to correspond to the numbers 1,2,. . . w is immaterial, we 
can, without loss of generality, suppose numbers assigned to the 
variables in such a way that Z>' is the minor formed by the first 
n — 1 rows and n — 1 columns; we thus get 

D' • • • “»-i\ =4= 0. ... (6) 

This condition expresses the fact that no relation exists 
between the first n — 1 functions. 

Now we know that if a reversible transformation is applied 
to the x’s, it follows from hypothesis (5) that the determinant 
of the with respect to the new set of variables y is also zero. 
Let the relation between the x’s and ^’s be given by the following 
equations: 


• • • ^ii)> 

yn 

We may note thau these formulse define a reversible trans- 
formation, since the functional determinant of the y’s with respect 
to the x’s is 

du^ dui du^ I 

dx^ ' " dx,. 


9 

8«»-i 8 m»- i 

dxi ' ■ ’ 8a:„_i 8a5„ 





% 


Vp, 



0 


0 


1 
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Theorem. — Given n functions u of n variables x, the necessary 
cmd sufficient condition for the non-existefnee of any {differentiable) 
relation between them of the type 

f{U^, U^y . . . uj 0 .... (3) 

involving only the u’s and, not the x’s, is that their functional deter- 
minant does not r>anish identically. 

We shall first show that the condition is sufficient; then that 
it is also necessary, but for the moment confining the proof to 
a particular case; the theorem in its general form will be shown 
farther on to be itself only a particular case of another still 
more general theorem (cf. § 7). 

Suppose the condition satisfied 



We shall then show that no relation of the type (3) can exist. 
(Identities are of course not considered; i.e. we exclude the case 
where equation (3) is satisfied when arbitrary values are assigned 
to the u'By as it would not then represent any relation between 
the u^B.) Suppose that such a relation does exist. Differentiating 
with respect to . . . x,^y we should get n equations 




(i =- 1,2,... n). 


linear and homogeneous in the derivatives 


du„ 


Now since by 


hypothe.sis / is a true function, not zero identically, these deri- 
vatives are not all zero. Hence the determinant of the coefficients 
of this group of equations vanishes; i.e. D ~ 0, which is con- 
trary to our hypothesis. The condition (4) is therefore sufficient 
to secure the non-existence of any relation of the type (3). 

To prove that condition (4) is necessary, we shall show that 
if it is not satisfied, i.e. if 

( 5 ) 


then the w’s are connected by a relation (at least one) of the 
(3). For the moment the only case considered will be that 
in which at least one of the minors of order n — 1 of the deter- 
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minant D does not vanish. This minor will in general be of the 
type 


U = 





^2 




tfn-l 




) 


where _i and Ji, . . . _i represent any two arrangements 

of n — 1 integers chosen without repetitions from the numbers 
1, 2, .... n. But since the order in which the x'b and w’s are 
made to correspond to the numbers 1, 2, . . . n is immaterial, we 
can, without loss of generality, suppose numbers assigned to the 
variables in such a way that D' is the minor formed by the first 
n — 1 rows and n ~ 1 columns; we thus get 



This condition expresses the fact that no relation exists 
between the first n — 1 functions. 

Now we know that if a reversible transformation is applied 
to the x'h, it follows from hypothesis (5) that the determinant 
of the with respect to the new set of variables y is also zero. 
Let the relation between the x's and y’s be given by the following 
equations: 

J/i ~ '*^'1 (^i> • • • ^#i)> 


Vu ^ 



We may note that these formula define a reversible trans- 
formation, since the functional determinant of the y’s with respect 
to the x’s is 



dui 

dui 

dxi 

■■■ 

dx„ 




dxi 

5®„_i 

dx^ 

0 

0 

1 
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and expanding this from the last row, it is seen to be equal to 
D\ which by hypothesis is not zero. 

Now consider the u's as functions of the y’s; using equations 
(7) we get 

[ . . . ( 8 ) 

= u.Xyi, . . . yn-i, yn)-) 

Expressing the fact tliat the determinant of the u’a with 
respect to the i/’s is zero, we get 


It 


1 

0 

0 

0 

0 

1 

0 

0 

0 

0 

• V » • 

3 


du„ 



1 




1 



It follows that the last of the equations (8) does not contain 
y,^; substituting in it from the remaining equations, it becomes 

w,. ” i), 


i.e. a relation between the t/’s which does not contain any of the 
x’s. 

Hence from the hypotheses (6) and (5) it follows that there 
exists one relation of the tyi)c (3), which is such that can be 
expressed in terms of tlie other w’s. This relation is unique, 
because if there were another, tlieri eliminating betw€jen them 
we should get a relation between . . . u„ but this, as already 
pointed out, is incompatible with liypothesis (6). 


G. Functional matrices. Definition of the independence of 
m functions of n variables. 

We shall tiow examine the more general case in which the 
number m of the functions u is not equal to the number n of the 
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variables a?. For this purpose we must consider the functional 
matrix of the given functions, i.e the following matrix of m 
rows and n columns: 


du^ 

dui 

dill 

dx^ 

* ' * 

0a5« 

1 

9m,„ 

• • • 

9 Mm 

• • 

0M»,. 

' dx-^ 

0*2 

0X,. 


In what follows it will be denoted by M; but it must be noted 
that no numerical value is attached to the symbol, and there- 
fore that M does not represent a quantity, but is an abbreviation 
for the arrangement of terms under consideration. 

The characteristic of a matrix is the order of the non-vanishing 
determinants of highest order which can be constructed from it; 
it can therefore obviously not be greater than the number of 
rows or the number of colunms, whichever is the less. 

We now give a definition, which will be justified in the follow- 
ing section. 

Definition. — in f unctions of any number of variables are said 
to be independent when the characteristic of their f unctional matrix 
is m. It follows immediately that if the number of functions is 
greater than the number of variables, the functions cannot be 
independent; while if the two numbers are equal, the definition 
coincides with that already given, since the matrix becomes a 
determinant of order m, and if its characteristic is m this is 
equivalent to saying that the determinant does not ^"anish. 

7. Theorem. 

Given m functions u of any number of variables x, if the 
characteristic of their functional matrix is k, then there are m — k 
relations {and not more) between the u’s which do not inmlve 
the x.^s. 

It will follow immediately as a corollary that if the functions 
are independent (the case A; == 7?^) there exists no relation between 
them. 

The theorem just stated has been proved above (§ 5 ) for the 
particular cases in which the number of functions is equal to the 
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number of variables and in addition A = m or k — m — 1. 
We proceed to prove it in general, taking various cases in turn, 
as follows: 


(1) h = m (and /. m < n), the case of independence; 


(2) k <, m\ 


i (2a) k fly 
1(26) k < n. 


Case (1): k — m. This hypothesis is equivalent to saying 
that there exists a minor of order m which is not zero; remember- 
ing the remark made on p. 7, we may suppose without loss of 
generality that 



Applying the theorem of p. 6 it follows that the are not 
connected by any relation which does not involve any of the 
a?’s. 

Case (2a): k Ci rriy k = n. There is therefore a minor of 
order n which is not zero. We may arrange the suffixes of the 
u’s and the a;’s so that the minor in question is that formed by 
the first n rows and n columns, and we shall have 




We shall now show that can all be expressed 

in terms of the remaining ?^’s, without using the a?’s, so that we 
shall have m — n (which is the same as m — k) relations between 
the m’s. For since D ^ 0 we may change the variables. Let 
the new variables be given by the equations 

. . . X^y 


Solving these equations with respect to the x’s, and substituting 
the expressions so obtained in these will be expressed 

as functions of . . . u^^y hence the theorem is true for this 
case. 
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Case (26): h < m, h < n* The hypothesis is that there exists 
a determinant of order h which is not zero, and that every deter- 
minant of higher order vanishes. Let us arrange the w’s and the 

so that 

D = 0 (9) 

\a:i . . . xj 

We shall show that any function t/y, (6 = A: + 1? • • - 
can be expressed in terms of the first k functions u, without 
involving any of the x's. For this purpose, consider the deter- 
minant 0 formed by bordering D with the (A -j- 1 )th column 
and 7ith row of the matrix; since it is of order A; + 1, it is 
zero by hypothesis, i.e. 

© = \ = 0 . . . . ( 10 ) 

\a^ . . . x,, 

Now applying the theorem stated on p. 6, it follows from this 
equation and the inequality (9) that tty, can be expressed as a 
function of . . . %, which does not involve ajj, . . . Xj^, i.e. 

since wo are not yet able to say anything about the remaining 
x's, 

I • • (^1) 

The next step is to show that ajy^^g’ do not in fact occur 

in this expression. If n A: + 1, there is no need to consider 
^/f+ 2 ’ • • • therefore the formula (11) represents the expres- 

sion we are in search of, giving U/^ in terms of . . . Uj^ alone. 
If this is not so, let Xj denote any of the variables 
and consider the determinant obtained from 0 by replacing 
by Xj^ so that 



0' vanishing because it is a minor of order A: + 1 taken from the 
matrix. Expanding it, substituting from equation (11), and 
making certain transformations, we can easily show that it 

involves the vanishing of whence it follows that S does not 

vXj 

contain x. In fact, representing compactly by the letter D the 
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square matrix of those elements of 0 which form the determinant 
Z>, we have 

0% 

dxj 

D 

== dujf. 



9 % 9 m /, duf, 

dx^ dx^ dxj 


Using equation (11) the elements of the last row are given by 
9ma _ V 9^6 dui .. _ , ,, 


0% _ ^ 

dxi i^dui dx, 


dUh __ ^ ^ d(f> d2ii 

dxj dXj i^dui dxj 


{i -=- 1, 2, . . . k); 


Multiplying the elements of each of the first k rows in turn 
by , . . . , and subtracting the sum of these products from 

0 W 0 

the elements of the last row (which does not change the value 
of th'3 determinant) the last row becomes 


0 ... 0 


and therefore, expanding from this row, we get 


Since by hypothesis D 4= 0, it follows that = 0, which 

dx^ 

proves the assertion. 

The theorem enunciated at the beginning of this section is 
thus completely proved. Applying it to the particular case 
m = n, it coincides with the theorem of p. 6, Avhich is therefore 
now shown to hold without any restriction. 
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CHAPTER II 

Systems op Total Differential Equations 

1. Preliminary remarks. 

The reader may first be reminded of some general considera- 
tions on differential expressions. 

Given a function /(ccj, Xg, . . . xj, the expression 

df = 2 , dx, 

1 dx, 

is called the total differential of the function /; it is equal (except 
for infinitesimals of higher order) to the increment of / in passing 
from the point x^, X 2 , . . . to the infinitely near j)oint 

X 2 djX^^ ■ • « x^^ “I” dx^f. 

Given n functions X, of the x’s, which, together with theii 
first derivatives, we shall suppose finite and continuous, the 
expression 

n 

ifj (Xi, X 2 , . . . xj dXi . , , (1) 


is called a differential, or Pfaffian, expression. 

An expression of this form is not always an exact differential; 
i.e. there does not always exist a function /(x^, Xg, . . . x,,) such 
that the given Pfaffian is its total differential. The necessary 
and sufficient condition for the existence of such an/, i.e. for the 
integrability of an equation of the type 

df i,X,dx,, (2) 

1 


is that the following ^n{n — 1 ) conditions should be satisfied: 


dXi _ 
dx^ dXi 


(i,j 1, 2, ... w). 


(») 


If these conditions are satisfied in a certain field, the integral 
calculus shows how to construct the most general function / 
which has the requued property; i.e. it shows how to integrate 
the given differential expression. All the possible /'s differ from 
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one another by a constant. If we follow the procedure usual in 
elementary treatises, and consider not the whole field but a suit- 
ably restricted region round a point x arbitrarily fixed in advance, 
then in this region each of the/’s is a uniform function (i.e. one- 
valued, like all the functions we are considering) of the argu- 
ments a^i, iTg, . . . x„. 

We now proceed to discuss a more general problem than this. 
Let there be m unknown functions u of n independent variables 
X, and let there be given a set of relations between their differ- 
entials which define the dus in terms of the dx’s, in the form 

n 

du^ == S. {x I u) dxi (a = 1, 2, . . . m), . (4) 

1 

where the X^& are mn arbitrarily assigned functions (finite and 
continuous, together Ai\dth their first derivatives). 

A group of relations of the type (4) is called a system of 
total differential equations^ \ equation (2) is obviously only a 
particular case. It may be remarked that equation (2) is itself 
equivalent to the system of n equations 

XJ.x) (i = 1,2, . . (2') 

OX, 


and that the equations (4) are analogously equivalent to the 
system of mn equations 



Both are problems of partial differential equations, and are 
soluble only under specific conditions; but if these are satisfied, 

^ III a sysU’in of this kind the group of VHriablea to be considered indeptindent 
is fixed in advance. Tlit* late Professor G. R-icci in a recent work has considered 
instead a systeiu of / equations of the tyjie 

2 firg (x) dxg =0 (r = 1, 2, , . . i), 

1 

determining the conditions that the n variables x may be considered functions of 
any number p ( < n) of independent variables, and indicating the stops necessary 
to find the solution (cf. Atti del Heale 1st. Ven., Vol. XXXI. 1922-fi, pp. 179-183). 

An ac;count of the general theory t>f I'fatfian systems, with recent develf)p- 
ments due mainly to von Weber, Cartan, and Goursat, is given in the last>nained 
author’s sur le prohUme de Pfaff Hermann, 1922). 
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we shall see that the integration reduces to that of ordinary 
differential equations. 


2. Conditions necessary for integrability. Completely in- 
tegrable, or complete, systems. 

When the problem is stated in the form (4'), it is obvious 
(from the symmetry of the second derivatives of the that 
a necessary condition for the existence of solutions is that the 
following conditions shall be satisfied: 

^^ a\L _ ^ 1 , 2 , . . . 

dxj \i,j = 1, 2, . . . n / 


The symbol denoting total differentiation has been used as 
a reminder that in differentiating it is necessary to take into 
account that the arguments ti also depend on the ac’s, i.e. that 


dxj 


dx, 1 ^ du, 


du^ 

~dxj 






'/S 


( 6 ) 


Using this result, the equations (5) take the form of ^mn(n — 1) 
relations of the type 


F{x I u) = 0. 


( 5 ') 


These, it will be seen, in general contain not only the x’s but 
also the i^’s (unlike the equations (3)); and we must suppose the 
replaced by those unknown functions of x which satisfy the 
given system of equations. The conditions of integrability can- 
not therefore be given explicitly without knowing beforehand 
the solutions of the system. This difficulty did not arise for the 
equation ( 2 ), since the X's, and therefore their derivatives, did 
not contain the unknown function. 

But it may happen— and this is the most interesting case— 
that the equations 0 >) are not only satisfied for those particular 
which form a solution of the system, but are true identically, 
i.e. for any set of valutas whatever of the u's and of the £r’s. In 
this case, as we shall see, these conditions are not only necessary, 
but also sufficient, for the integrability of the system, which is 
then said to be co7npletely integrable, or complete. 
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3. The integration of a mutually oonaistent system can 
always be reduced to that of a complete system. 

We shall now show that whenever a system of total differential 
equations is integrable (in the sense that there exists at least one 
set of m functions X 2 , . . . x„) which satisfy the system), 

the integration reduces to that of a complete system; we shall 
thus be able to confine our subsequent discussions to systems of 
the latter kind. 

As we have already said, there are ^mn(n — 1) conditions of 
integrability (5'), while there are m u's. Now for n > 2, 
m < ^mn{n — 1). In general, therefore, there cannot be m func- 
tions which satisfy these conditions, and therefore the system 
can certainly not admit of solutions. If exceptionally these con- 
ditions are mutually consistent it may happen either that m of 
them are independent, so that there is then one single set of 
values for the u^s which satisfies these m conditions, and it only 
remains to test whether these u's also satisfy the given system 
of equations; or that they are all satisfied identically (and then 
the system is complete); or that— the most general case— they 
reduce to a number v < m of mutually consistent and indepen- 
dent equations. In the latter case, v of the unknowns can be found 
in finite terms, expressed in terms of the x's and the remaining 
m — v == /X unknowns. Arranging the w’s in a suitable order,, 
we may suppose that the equations (5') give us the last of thf 
functions u, viz. the functions 

-H 1 > + 2? • “ • 

in terms of the x’s and the remaining 

^' 2 » • • • 

For greater clearness, we shall denote these first fx functions 
u by w' (a 1, 2, . . . /x), and the last v by 
(^ = 1,2,... v). Using this notation, the equations (5') can 
be put in the form resolved with respect to the , namely 

u'; f,(T\u') 0=1,2 u ). . ( 6 ") 

Next, suppose the system of equations (4) divided into two 
groups; one consisting of the first /x: 

n 

dul = S,- (x I m) dxi (a = 1, 2, . . . /*); . (4o) 

^ (dB66) 
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and the other of the remaining v: 

n 

du^ = {x\u)dXi (a — /* + 1, /X 4- 2, . . . m — /a + v). 

1 

The latter group, putting a — ft + A we shall write 
in the form: 

dul' ^ S, {x I u) dXi = 1,2,... i^). . (46) 

1 

Substituting from the equations (5") and (4a), the two sides 
of this last equation become linear expressions in the differentials 

with coellicients which depend solely on the ic’s and the w’s. 
Since the coefficients on both sides must be the same (the differ- 
entials dx^ being independent), the equations (46) reduce to 
equations in finite terms, nv in number, between the u'*& and 
the x's. 

If all these reduce to identities, we need only consider the 
system of equations (4a), in which the functions ti" are to be con- 
sidered as replaced by their values as given by the equations 
(5"), so that we have a total differential system, of the same 
form as the original system (4), invf)lviug only the ft in 
number, where ft — m — v<im. The essential result in the 
case under consideration is that the system (4a) so reduced is 
necessarily complete. In fact, it consists of a part of the original 
system (4) with the additional relations (5") between the w’s. 
The condition of intcgrability of the whole system (4) (where 
a 2 mori the ?/’s were treated as so many unknowns) consisted of 
the equations (5), or, we may say, of the equivalent equations 
(5"). For the system of equations (4a) the analogous conditions 
will consist of a part of the coinlitions (5") (or combinations of 
these), with the j)roviso that every a" is to be replaced by the 
COT responding exj>ression given by the equations (5") themselves. 
This process obviously leads to mere identities; hence, as stated, 
the system (4a) is complete. 

If on the other hand the equations (46) give rise to non- 
identical relations in finite terms between the and the cc’s, 
we shall have to associate them with the equations (4a) and 
treat this whole system of equations in fx unknowns (including 
some total differential equations and some equations in finite 

( I>(565 i ^ 
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terms) as we have already treated the system of equations (4) 
and the conditions (5). 

Proceeding in this way, we shall reach a stage where either 
the conditions are found to be mutually inconsistent, when we 
must conclude that the given system has no solution, or else the 
problem reduces to the integration of a complete system (with a 
number of unknowns which is certainly less than m), Q.E.D, 

In consequence we shall now confine our attention solely to 
complete systems. 


4. Bilinear covariants and the resulting form for the conditions 
of complete integrability. 


We have expressed the condition of complete integrability 
by means of the equations (5), which are supposed to hold for 
arbitrary values of the u's and of tlie rr’s. We shall now express 
thivS condition in a more concise form. 

For this purpose take two different systems of infinitesimal 
increments of the x’s, denoted by dx^ and Sx, respectively; the 
corresponding increments of a generic function u of the x's will 
then be denoted by du and Su respectively, and will be given 
by 


du = 

— 


1 GX: 


I OXi 


( 7 ) 


Now the dx’s are arbitrary infinitesimals, on which we can 
a 'priori impose any hypotheses we please; we shall consider 
them as infinitesimal functions of the x"s. With this hypothesis 
the increments of these rfx^s, corresponding to the increments 
Sx of the variables, will naturally be denoted by Sdx; with a 
similar interpretation for dSx, The increment du will also be an 
infinitesimal function of the x’s, and we shall thus have to con- 
sider 8du; dSu will be similarly defined. We shall next obtain 
the explicit expression of these two seconds differentials of in 
order to show that a slight restriction on tlie arbitrariness of 
the second differentials of the independent variables will be 
sufficient to ensure the result 8du = dhu, whatever the 
function u may be. 
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Applying the symbol of operation S to the first of the equations 
(7), we get (without any restrictive hypothesis) 


hdu 




1 OX, 


dx,Sxj + 

1 I o x,d Xj 1 3 2 ;^ 


3% 


du 


( 8 ) 


The expression for dSu will evidently be similar, with d and 
8 interchanged. Now the first part of the formula is unaltered 
by this interchange, while in the second 8dx, is replaced by 
d8x^. If therefore we impose on the arbitrary functions dx and 
Sx of the x’s the condition 

dSx^ = 8dx, {i — 1, 2, . . . n), . . (9) 


which represents a very small loss of generality, the second part 
of the formula (8) will also be unaltered when ri and S are inter- 
changed; we shall therefore have, for any function whatever 

W(Xi, Xg, . . . X,,), 

dhu = hdu (10) 


It may be noted incidentally that in the differential calculus 
it is usual to impose a hyi^othesis involving considerably greater 
restrictions than the conditions (9); the usual convention is 
that the second differentials of the independent variables are 
zero, or that the rfx’s are not functions of the x’s, but constants. 

We shall now consider, along with the increments of the 
independent variables, not a function with its differentials, 
but a generic Pfaffian 

1 


in which the X’s are given functions of the x’s. 

The suffix d has been inserted as a reminder that the Pfaffian 
refers to the increments dx,\ the same Pfaffian relative to the 
increments Sx, will be conveniently distinguished by the analogous 
notation 

1 
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Both ijia and will naturally be functions of the ar’s. Cal- 
culating we thus get 

8^., - S, SX. dx, + S,. X, Ux, = S, S; dxi 8xj + S, X, Sdx, ; 

1 1 1 1 OXj 1 

or with the abridged notation which can be used when several 
summations between the same limits are applied to the same 
general term, 

n piY n 

+ X,X,8dx^, 

1 ox, 1 


Interchanging d and 8 we get Using the relation (9), 

the difference 8^,^ — rediices to 


n 



^^-'dx,8x 
dx. ^ 


^ dX, 

dx, 


8x^ dxy 


But the value of a sum is plainly unaffected by the parti- 
cular letters of the alphabet which we choose to assign to the 
suffixes with respect to which the summation is to be made. 
We may therefore intercliango i and j in the second part of the 
j)receding formula, so that we can now wTite the equation in the 
form 

8 ^,-#, . ( 11 ) 


The expression 8ifj^, - difj^ is called the bilinear covariant 
relative to the given Pfaffian. The use of the term “ bilinear 
is sufficiently justified by the expression just found, which is 
linear in the arguments dx and also in the arguments 8x, The 
name “ covariant ” is due to the circumstance that the numerical 
value and formal structimi of the two sides of equation (11) 
always remain the same when the independent variables x vary 
in any way whatever. But we shall return to this point farther 
on (cf. Chapter VI, p. 144) in comiexion with the general idea of 
invariants (functions or differential forms). 

Meanwhile it may be noted that if the Pfaffian is an exact 
differential, i.e. if the conditions (.3) are satisfied, the right-hand 
side of equation (11) becomes zero, and we reach a result which 
has already been found (cf. formula (10)), 
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We may now return to the examination of the system of 
equations (4), and the conditions of complete integrability. Con- 
sider the m Pfaffians which constitute the right-hand sides of 
the equations (4): 

and construct their bilinear covarianta. We shall show that the 
two conditions: (a) that these covariants vanish identically, 
however dx and Sx are chosen; and (6) that the equations (5) 
are identically true whatever values arc assigned to the ii's, 
are completely equivalent, so that the condition of complete 
integrability may be written in the form 

S^;>— =0 (a 1, 2, . . . m), . (12) 

it being understood that this equation must hold for arbitrary 
values of the increments dx and 8x.^ 

To prove this, take the explicit expression of these bilinear 
covariants, in the form given by equation (11). In differentiating 
it must be remembered that the X’s must be considered as 
functions of the ir’s, both directly, and also indirectly as functions 
of the ^^’s. Using the convention already adopted, the derivatives 
can therefore be denoted by fhe symbol for total differentiation; 
equation (12) thus becomes 

= 0 . . (120 

1 ‘ \ dxj dx^ / 

Now if the conditions (5) for complete integrability are satisfied, 
the coefficients of this bilinear form (i.e. the expressions in paren- 
theses in equation (12')) arc all zero, and therefore the equation 
is satisfied however the dx^a and Sic’s are chosen. Vice versa, 
suppose that the equation is satisfied however the dx^s and Sa^’s 
are chosen. Then all the coefficients must necessarily be zero. 
For if we take all the dx^s and Sa^’s as zero, except one pair, 
e.g. SXj, where % j\ are two arbitrarily chosen but definite 

^ As a matter of fact we have imposed the restrictions (9) on the second 
differentials 5</r,, d5xi, but the infinitesimal increments dxi, 5xi to be assigned to 
the Xi’a at the generic point under consideration are still entirely arbitrary. 
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integers of the series 1, 2, ... n; then the sum in equation (12') 
reduces to the single term 

which cannot vanish unless 

I i a\j Q 

dxj dx^ 

We therefore conclude that the conditions (6) can be written in 
the more concise form (12). 


5. Morera’s method of integration.^ 

We shall now show that the conditions of complete integrability 
are sufficient for integrability, or more precisely that if they are 
satisfied there exists one and only one set of m functions u{x) 
which satisfy the given system of equations and have values 
arbitrarily fixed in advance at a point also fixed in advance. 
Considering these initial values of u as arbitrary constants (as 
evidently they may be considered to be), we can sa.y more 
shortly that the general integral depends on m arbitrary constants, 
or that there are integrals. 

For the proof, we first fix a generic point 
in the field of variation of the a^’s in which the X's are defined. 
Let Pi(x\, x], , . . xl) be another arbitrary point in the field, and 
suppose it joined to Pq by a line T which does not leave the 
field. T will be defined by parametric equations 

^ <^-(0 {i = 1, 2, . . . n), . . (13) 


where ^ is a parameter which has the value at Pq and the value 
at Pj. We shall provisionally confine our investigation to the 
points of this line, so that for the present any functions u of the 
a;’s are to be considered as functions of the variable t alone (via 
the and the equations (13) ). Their derivatives will be 

du^ ^ dx, 

dt 1*3 X: dt 


(a = 1, 2, 


m), 


^“Ziir Integration der v oils tan digon T>i1Ierentiale ”, in Math. Ann.^ Vol. 27, 
1886, pp. 403-41 1. Cf. also Skvehi: “Sul nicitodo di Mayer per I'integrazione 
delle equazioni lineari ai differenziali totali”, in Atii del JL 1st. Veiieto^ Vol, 
LXTX, 1910, pp. 419-426. 
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or, denoting difEerentiation with respect to « by a dot, and sub- 
stituting from equation (4'), 


du^ 

dt 


1 


(14) 


dt 


(a = 1,2,... m). . (14') 


The XiB are known functions of t given by equations (13); 
hence the equations (14') are of the type 


dUg^ 

dt 


UJt I 1/2, . . . uj 


(a = 1, 2, . . . m), (14") 


i.e. they form a system of ordinary differential equations, in the 
normal form. Now given m arbitrary constants wj, ?/!], . . . 
it is known from the calculus that — subject to qualitative condi- 
tions of continuity and existence of derivatives, which we suppose 
satisfied— there exist m functions Ug^{t) which satisfy the system 
(14"), and which are equal to the given constants when t = 15^, 
If, therefore, the tt’s are given any arbitrary set of values at Pq, 
they are defined at all points of the line T, and therefore also 
at It may however happen— and does in general— that if 
the points P^, and P^ are joined by another line instead of P, 
different values will be found for the w’s at P^. But we shall 
now show that if the conditions of complete integrability are 
satisfied, the values of the at Pj, found by the method just 
described, are independent of the line P, so that these tt’s will 
be functions only of the co-ordinates of P^, that is, functions of 
position; they will satisfy the given system of equations not 
only along a line, but along all the infinite number of lines which 
can be drawn in the given field, or, in other words, in the whole 
of this field. They will therefore constitute the required solutions 
of the total differential system (4), as we shall show later on. 

We shall simplify our task by considering infinitesimal dis- 
placements; i.e. by showing in the first place that the values of 
the u^s at Pj remain unaltered if the line P undergoes an infinitesi- 
mal deformation; it will follow that they will be the same for 
any line which can be obtained from P by a succession of infini- 
tesimal deformations, i.e. by a continuous deformation of P. 
If then we suppose the field such that every line joining Pq and 
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^4 


Pj can be obtained in this way, we shall have all that is required* 
Such fields (e.g. a triangle or a circle in a plane, a cube or a sphere 
in space) are called simply connected. 

Consider therefore a line T* infinitely close to T\ we may 
think of it as obtained by displacing each point P of T, of co- 
ordinates a:,, to a [)oint P' of co-ordinates x, + S5c,, and the 
infinitesimal increment Sx- may be taken in the form €Xh for 
example, where every Xt ^ finite quantity varying from point 
to point of the curve (and therefore a function of t), and e is 
an infinitesimal factor taken as constant, and therefore indepen- 
dent of L With these conditions the parametric equations of 
the curve T' will be 

x,+ Sx, . • • (15) 


The functions Xi ^^7 5e considered as arbitrary, except for 
the condition of vanishing for t — and for t = so that the 
lines T and T' may have the same extremities. We shall adopt 
the natural convention of using the operator 8 to denote the incre- 
ment of a generic quantity (scalar or vector) in passing from the 
point P of P to the corresponding point P' of T\ 

Now suppose the equations (14") integrated along T'; we 
shall get functions of t, satisfying the equations 


(«„ + Sw.J 


or 







(a = 1, 2, . - . m), 


using hypothesis (12), expressing the complete integrability of 
the system, we can also write the equations in the form 


dt dt 


(16) 


From the theorem of the existence of integrals of ordinary 
differential systems (already referred to in connexion with 
equations (14') ), it follows that the quantities are uniquely 
determined by these equations together with the condition of 
vanishing at Po* Now the equations (16) are obviously satisfied 
by taking 




( 17 ) 
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(i.e. assuming for the quantities Su^ the expressions appropriate 
to the case where the te’s are in fact functions of the ai’s). These 
expressions vanish with the Sa;/s, i.e. at (so satisfying the 
initial conditions which, together with the equations (16), deter- 
mine them uniquely), and also at P^, which proves the required 
result. 

It is thus proved that in order to construct the functions 
u whose total differentials are the assigned Pfaffians (satisfy- 
ing identically the equations (12) or the original equations (5) ) 
and which have given values at a given point A, we need 
only join Pq to any point by any line 1\ and integrate the 
system of ordinary differential equations along T. 

To complete the proof, we must now show that the differentials 
of the functions of the co-ordinates of Pj obtained in this way 
are in fact the functions Consider a j)oint P 2 infinitesimally 

close to P^; to construct the values of the w’s at Pg the 
broken line made up of T and the small segment PiPg- It is 
then obvious that integrating the equations (14) along this line 
we get, in passing from P^ to A, the increment du^ = 

6. Note on Mayer's method. 

The method followed in the preceding section to show the 
existence of the integrals of a complete system of total differential 
equations, is due to Morera. 

There was an earlier method, proposed by Mayer, by no 
means so clear, and seemingly dependent on a j)urely formal 
device. Morera’s method, which is inspired by geometrical 
intuition, brings out the true reason for the success of Mayer's 
device, and provides a criterion for its validity. 

Mayer’s method is to join the points Pq and Pj by a segment 
of a straight line, instead of by any line T, so giving the equations 
(13) the form 

== + (^) — (i ^ 1 , 2 ,... n); 

the proof consists of a series of purely algebraic operations, 
instead of the proof developed above almost without calculations. 
In addition, while Morera’s method can be applied if we merely 
suppose that the field in which the given equations hold is simply 
connected, Mayer’s method, on the contrary, obviously requires 
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a much more restrictive hypothesis, namely, that any two points 
in the field can be joined by a straight line which lies wholly in 
the field. This property is expressed by saying that the field is 
convex. 


7. Application. 

Given a generic Pfaffian 

i,X,{x)dx„ 

1 

we shall investigate whether it is possible to find a relation between 
x’s of the type 

/(Ti, x^, . . . x,^) C {C constant), . (18) 

which shall be an integral of the equation 

tp S,- Xi dxi 0, . . . . (19) 

1 

in the sense that the relation produced by differentiating equation 
(18), namely, 

df-.^^.'^'Ldx,-.^Q,. . . . (18') 


is equivalent to the equation (19). 

For this it is plainly both necesvsary and sufficient that the 
derivatives of the unknown function f should be proportional 
to the given functions We therefore need some test to apply 
to the X's themselves which will show whether they are pro 
portional to the derivatives of a single function not known in 
advance. 

This problem, which also occurs in geometrical questions (as 
we shall see in particular on pp. 263-265), reduces at once to a 
particular case of a total differential system. In fact, given that 
0 does not vanish identically, and therefore has at least one of 
its coefficients not equal to zero, we may legitimately suppose 
that does not vanish identically. We can thus write equation 
(19) in the form 


dx^^ 


71-1 



I 



(19') 
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In order that this may be equivalent to the equation (18'), 

we must have 4 = 0 in the latter* From this condition it follows 
o x,, ^ 

that the equation in finite terms (18) defines a function 

x„ C), . . (18") 


which makes equation (18) an identity, and therefore also equation 
(18'), as well as the equivalent equations (19) and (19'). This 
last equation is evidently a particular case of systems of the 
type (4) consisting of one equation and one unknown function 
it must therefore be completely integrable, having as integral 
the function given by formula (18"), which depends on the arbi- 
trary constant C. Reciprocally, if (19') is completely integrable, 
then there will be a solution (18") depending on an arbitrary 
constant C; solving with respect to C, this becomes an integral 
relation of the desired form (18). The problem therefore reduces 
to expressing the completeness of equation (19'). 

Apjjlying formula (5), the required conditions of completeness 


are 

d Z, _ d X, 
dxj X„ dx, X,, 


(i, jf = 1, 2, ... n — 1; j). 


Expanding the derivatives, these relations are easily put in 
the following form: 


{i, j — 1, 2, ... n —1; i 4^ j). 


( 20 ). 


Introduce for the moment the restriction that all the other 
functions X, as well as X^, are different from zero. We can then 
write 


Pr, 


1 /dx,_d^\ 

X^x\dx, dxJ 


{T,S = 1,2, .. . n) (21) 


whatever r and s may be, so that the conditions of integrability 
take the more concise form 

Pi} + P}». + Pm = 0 (i, j = 1, 2, ... w — 1; i =t=i). (22) 

The conditions (22) are \{n— 1) (» — 2) in number, this being 
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the number of ways of choosing two distinct integers, i, j, from 
the series 1, 2, ... n — 1. They represent aZZ the conditions of 
integrability. Now the choice of the variable to be expressed 
as a function of the remaining x's was arbitrary (subject only to 
the condition X^^ 4= 0); hence in general the relations 

Pi) + P^k + Pfri = 0 . . . . (22') 

must be satisfied, where i, j, k, are any three integers, no two of 
which are the same, chosen from the series 1, 2, ... n. Such 
a triplet can be chosen in ^n{n — 1) {n — 2) ways; this is therefore 
the number of relations of the form (22'). But these are of course 
not all independent, since the conditions (22) (which form only 
part of (22') ) are sufl&cient for the complete integrability of the 
expression under consideration. In fact, it is easy to show directly 
that only '|(n— l)(n — 2) of the equations (22'), e.g. those given 
by formula (22), are essential, the others reducing to algebraic 
deductions from them. 

This can be shown by moans of the following lemma, which 
holds whatever the terms may be. If {i, = 1, 2, ... n) 

is a double skew (or antisymmetrical) system,^ and if for some 
fixed suffix a the cyclic relation 

Pa + Pifa + Pai ^ 0 

is true for every pair of suffixes t, k, then this relation is also 
true for any three suffixes /r, L 

To prove this, take the corresponding relations for the pairs 
ki I, and I, i, 

PU + Pla + Pak =-= 0, 

Pli + Pt« + Pal — 

Adding, and remembering the condition of skewness 

Pka + Pak = 0. 

there remains 

Pik + Pkl + Ph = 0* Q.E.D. 

Substituting in equations (22') the values of the p’s given by 

^I.e. a system of nuiiibeir. such that a one-to-onf oorrespondence, by a given 
law, exists between them and the pairs of integers i, ^' ( = 1, 2, , . , n), and such 
that pn =t - pje^ for any pair of indices whatever. 
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formula (21), and multiplying by XiX,Xif so as to clear of 
fractions, we get the equations of condition 


(af - (af-lf ■)+^‘ = « 

\V JUf. JUj / \ U U / \U JUj V JUi / 

(i, k 1, 2, . . . n). 


We thus find this whole set of equations as a necessary con- 
sequence of that group of them- -say, the group (20)— in which 
one of the suffixes is fixed, with the further condition that none 
of the -X’s vanish. This last condition was applied at the point 
where we divided by the product of the -SC’s; it is, however, not 
essential, and can ultimately be discarded, as we shall now show. 
In fact, the equations (23) being necessary consequences of the 
equations (20) for any non-zero values of the Jf’s, however 
small, and being integral in the X’s and their derivatives, it 
follows that we may pass to the limit when any one of the X’s 
tends to zero. We therefore have, for all values of the X’s, that 
the equations (23)— or a group of them of the type (20)— con- 
stitute the necessary and sufficient conditions for the complete 
integrability of the equations (19), or, in other words, the condi- 
tion that the n functions X,(iPj, may be proportional 

to the derivatives of a single function. 


8. Mixed systems of equations. 

In certain problems we have to deal with mixed systems, i,e. 
thoise containing some total differential equations and some 
equations in finite terms: 

du^ — (a = 1, 2, . . . m), . (4) 

1 

i\ {X \u) ^ 0 (k -1,2,... r). . . (24) 

The discussion is essentially the same as in § 3 (p. 10). But 
we propose to go through it again in order to obtain, in a form 
suitable for use in concrete cases, the condition of complete 
integrability of a mixed system of the type (4), (24), 

It is obvious in the first place that a necessary condition for 
the existence of solutions is that the equations (24) (which we 
shall suppose mutually consistent and independent) are not more 
in number than m, the number of the unknowns u. If there were 
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exactly m of them, they would completely determine the w’s, 
and we should only have to examine whether the w’s satisfy 
the equations (4). We shall therefore suppose 

V < m, 

and shall imagine the equations (24) solved with respect to v 
of the us, which will tlms be expressed in terms of the x's and 
the remaining m — = /x unknowns u. 

As on p. 1(), we shall call the two groups of respectively 
1, 2, . . . v) and u'^ (a = 1,2,... /a), so that the equations 
(24) may be written (cf. equations (5") ) in the form 

(i8= 1 , 2 , . . . v). . . (24') 

Corresponding to this division of the u^s into two groups it 
will be convenient to divide the equations (4) into two groups 
(la) and (46) (as was done on p. 16), which we repeat here for the 
reader’s convenience: 

It 

dul = HiX^i^dXi (a — 1, 2, . . . n), . . (4o) 

1 

du” ^iX^+p^,{x\u)dx! iP = 1,2, .. . v). (46) 

We now propose to show th*at the given mixed system is 
comx)letely integrable — and it will be called complete — if the 
following conditions are satisfied: 

(а) The conditions (5) for the complete integrability of the 
equations (4) arc satisfied when after differentiation the values 
of u"' given by equations (24') are substituted in them; they 
need not in general hold when any arbitrary functions are taken 
for the w"’s; 

(б) When the functions u" are replaced by their values as 
given by equations (24'), the equations (46) must be identical 
with the equations obtained by differentiating the equations 
(24'); or more concisely, the equations (46) must reduce to iden- 
tities on substituting from equations (24'). 

We shall show that if the mixed system is complete, in the 
sense now considered, then the equations (4a), when the w"’s 
in them are expressed in terms of the w’s and the ac’s by means 
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of equations (24'), constitute a completely integrable system of 
fjL total differential equations in /x unknowns; the w'’s can there- 
fore be obtained from them, and hence, by equations (24'), the 
w"’s; by hypothesis (b) above, the equations (46) will thus be 
satisfied. Hence the problem will be solved and its general integral 
(p. 22) will contain — m — v arbitrary constants. 

To simplify the formulee, we shall agree that if 

O {x I w") 

is any function whatever of the a?’s and the it’s, then 

[O] (x I m') 


will denote the same function when the w"’s are replaced by the 
expressions (24'). We shall obviously have 



(25) 

(26) 


With this convention, we can write hypotheses (a) and (6) 
lespectively in the forms 





1 i» 


r ^Ai‘i 

L 

J 1 

- 9 j 

L ' J 


(i, j — 1, 2, ... w) 
(a = 1, 2, . . . fi). 


1*; M "■•••-)• w 

We have therefore to examine the conditions of complete 
integrability of equations (4a), which will be 

(a= 1, 2, . . . p), (29) 

dxi dXi 


and we have to show that they are satisfied identically. 
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Let ue transform the left-hand side of (29) by first writing out 

in full the result of applying the operator to a function of 
the x’a and the u'’s. We shall get 


dXj 


+ 


s .1 


or, using formulas (25) and (26), 


L 0®^ J L dup J dxj 


+ 2, 


and finally, using (28), 


+ f' K; ] J- 


Remembering that the m arguments u consist of the two 
groups u* and it will at once be seen that this is merely the 
left-hand side of (27). Interchanging i and jf, the right-hand side 
of (29) similarly is seen to be identical with the right-hand side 
of (27); equations (27) being supposed to hold, it follows that the 
equations (29) are satisfied identically. 

It follows that the mtegrafion of a cmnplete mixed system of 
the type (4), (24), reduces to that of a complete {and therefore inie- 
qrahle) total differential system in p unknowns. The general integral 
therefore contains p - - m — v arbitrary constants. 

If the mixed system is not complete, i.e. if the conditions 
(a) and {b) are not satisfied wdtliout further restrictions, then 
discussion on the lines of § S (p. 16) obviously shows that we must 
add to the equations (24) so many of the conditions (a) and (b) 
as do not reduce to identities in virtue of equations (24), since 
the equations (12) must hold whenever a set of m integrals 
exists. Repeating the same procedure, we reach either an incon- 
sistency, showing that equations (4), (24), can have no solutions. 
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or else a complete system with less than /x unknowns. In the 
latter case the number of constants in the general integral is also 
less than 

A particular result of the foregoing discussion is that if v 
independent equations in finite terms are associated with a system 
of total differential equations in m functions the differential 
system being itself complete, then in the most favourable case 
(i.e. when the combined system is also complete) the number of 
constants in the integral is lowered by v units, from m to m — v. 

In general (i.e. when the mixed system is not complete) the 
integrals, if they exist, certainly contain less than m — v con- 
stants. 


CHAPTER HI 

Linear Partial Dieperential Equations 
Complete Systems 


1 . Linear operators. 


In this chapter we shall frequently use iV to denote the number 
of independent variables, which will themselves be denoted by 
the letters z^, . . . Zx- 

Let/(2:i, . . . Zx) be any function whatever, subject only to the 
condition of being differentiable to any required order. The 
term linear operator relative to/ will be used to denote the opera- 
tion by means of which an expression of the type 


.V 

£ a 
1 02 


3 / 


is obtained from/, the a„’s being any functions whatever of the 
z's. An expression of this kind wiJl sometimes be denoted by a 
formula of the tyjje Af, in which it is hardly necessary to point 
out that A is not a quantity, but the symbol of operation just 
defined. 

We have therefore a- g 


It can at once be verified that the linear operator symbol 
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behaves in exactly the same way as the difEerentiation symbol 
when / is a sum, a product, or a composite function (a function 
of one or more functions); i.e. for two generic functions /j, 
we have identically 

^(/i+/ 2 ) ^/i 4-^/2, .... ( 1 ) 

... ( 2 ) 

with obvious extensions to any number of terms. Further, if 
/ is given as a function of n arguments which are 

themselves functions of we obviously have 

Af{Vi. V 2 , . . . vj == Avi f Av^-i- . . . + ^ Av„. (3) 


Now consider the result of applying successively the two linear 
operators 

1 oz^ 

B - ^ 

I 


the 6’s, like the a’s, denoting functions of z which are differentiable 
to any required order. 

The second-order operators 

A(Bf), B{Af) 


are thus completely defined; they may be written without danger 
of ambiguity in the form 

ABf, BAf. 


Writing out the first of these in full, we get by successive 
stages 


V 

== y, a ^ 

"02.. dz. 


02 „ dZp 
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Similarly, interchanging A and B, and therefore a and b, 
we get 

BAf = 




It appears from this that the two operators ABf and BAf 
are not equal; the second-order terms are however the same, 
as will be seen on interchanging the indices v and p in one of the 
two double sums. It follows that the difference of the two opera- 
tors in question is a linear operator of the first order; it is called 
the alternate function or Potsson^s 'parenthesis relative to the two 
operators A and B, and is denoted by the symbol of operation 
( -4, B), so that 

{A, B)f = ABf- BAf = - -Bo,)!/:. . (4) 

1 


It follows from the definition of the symbol that 


{A,B)f^~-(B,A)f. .... ( 5 ) 


We shall now establish a formal property of linear operators, 
which we shall use farther on. 

Let there be n linear operators 

-V 

AJ - - S„a;,| - (k = 1,2, ... n), 

1 dZj, 


and let any two linear combinations of these (which will also be 
linear operators), 

Bf = Sji- Xj, A^. fy 

1 

Cf ~ Af^fy 

1 

be constructed, the A’s and the /x’s being any differentiable func- 
tions whatever of the independent variables z. 

We propose to show that the alternate function {B, C)f is 
a linear combination of the operators A and of their alternate 
functions. For the proof, it is sufficient to write out (B, C)f 
in full; this gives 

(B,C)f= BCf- CB/= i,X,A,{Cf)-i,f.„A„{Bf) 

I 1 

~ ^kh {Ph f) — Ph '*^h (\ ^k /)]• 
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Applying the rule for the differentiation of a product, the last 
expression becomes 

1 1 
so that finally 

^)f = ^kh[{K^.H'h)^hf — (H'h^hK)^kf KH'hi^k) -^/j)/]- 

Q.E.D. 

2. Integrals of an ordinary differential system and the partial 
differential equation which determines them. 

Consider a system of n ordinary differential equations of the 
first order, in n unknowns a;,. Denoting the independent variable 
by t, and supposing the equations solved for the derivatives of 
the unknown functions, we get the equations in what is called 
the normal form.: 

X,{x\t) (i = 1,2, . . . n). . (6) 

Any set of n functions x^t) which satisfies the given equations 
is called a solution of the system. 

The term mteffral of the system, on the other hand, is used 
to denote any function f(x | /) wdiich reduces to a constant when 
the ic’s arc replaced by any solution of the equations (G). We 
can therefore vdj that / is an integral if the result 

f(x I /) - constant 

is a necessary consequence of the differential equations (6). 

We shall now show that all the functions / with this property 
(and no other functions) satisfy a homogeneous linear partial 
differential equation of the first order; it follows, as we shall 
sec farther on, that the integration of an equation of tliis form 
can always be reduced to that of a system of the type (0). 

Let f(x 1 1) be an integral of the equations (0); then by definition, 
when the x’s represent functions of t which satisfy equations (6), 
we have 


f(x 1 1) = constant, 
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and therefore, difierentiating with respect to t, 

0. 

3< i^dsCf dt 

or, since the functions *,(<) satisfy equations (6), 

-0 (7) 

ot 1 cx. 

This is the partial differential equation referred to. Introducing 
for shortness the linear operator 


A ^ , V V 3 

at 1 cXi 

we can write it concisely in the form 

Af^ 0 . 


. (7') 


Now by hypothesis equation (7), like the equation / = con- 
stant, from which it is derived by differentiation, becomes an 
identity when tlie x’s in it are replaced by any solutions whatever 
of the system (G); from this it is easy to deduce that (7) is an 
identity, that is, that it holds for any values whatever (in a 
suitable field) which may be assigned to the arguments x, 
of which / is a function. In fact, given n + 1 numbers sr*/, . , . x”, 
belonging to a field within which the general existence theorem 
holds for the system (6), we know that there always exists a 
solution X, of the system (6), which takes the values xj*, . . . xj^ 
when t Iq, Now equation (7) must hold (whatever t may be) 
when this particular solution x,.(/) is substituted for the x’s. 
In particular, putting t --- the equation is satisfied for the 
values x^*, arbitrarily chosen in advance. Q.E.D. 

It is further evident that any function /(x | t) which satisfies 
equation (7), when the x’s and the fs in it are treated as indepen- 
dent variables, constitutes an integral of the system (6). In 
fact, since equation (7) holds however the x’s are chosen, it will 
be satisfied in particular when we take a solution of the system 

(6) for the x’s; but when this is done the left-hand side of equation 

(7) becomes identical with The function / is therefore such 
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that when the 5c’s are solutions of the system (6), = 0, or 

f = constant. ^ 

To sum up, we can state that the necessary and sufficient 
condition that a function f(x | t) nmy be an integral of the system 
(6) is that it should satisfy the partial differential equation (7), in 
which the x's and t are n + 1 independent variables. 


3. Principal integrals. 

Among the integrals / of the system (6) (which, as we have 
seen in the preceding section, can also be called integrals of 
equation (7)), there are, for each value of t, n of special impor- 
tance which we now proceed to specify. 

We take as our starting-point the most general solution of 
the equations (G), which is known to be a set of n functions of t, 
containing n arbitrary constants 

Xi (fift I x^) (i 1, 2, . . . n). . . . (8) 

The constants x^ are the values of the x’s for a given value 

^0 that ^ , V 0 /Q\ 

\9i)f to — W 


We shall show first that the equations (8) are soluble with 
respect to the s in a region round the point Iq, Write them in 
the form ^ | _ 0 ^ 


and consider the functional determinant of the left-hand side 
with respect to the ir;^’s, wliich is 


D 


(^1 — < f > 2 — ^'2 


X, 





or, since the x^’s are contained only in the <^’s, and not in the 


x% 



Now calculate the value of this determinant for t = 
Since the determinant itself contains no derivatives with respect 
to t, we shall obtain the same result if we differentiate the functions 
<f>(t ( xP) with respect to the x®'s, form their determinant, and 
finally make t = t^, as if we first make ^ in the and 
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tlien form the determinant of their derivatives. Following the 
second alternative and remembering the formulse (9) we see at 
once that the determinant becomes 


T ) ^ (^1 4 - • • 4 \ ^ 

® \x^ xl . . . x^/ 


1 4 ^ 0 . 


Now if Z), which is a continuous function of t, does not vanish 
for t = Iq, it will have these same properties in some region 
round and therefore within this region the equations (8) will 
be soluble with respect to the x®*s. 

Solving the equations, we shall get 

4 = t) (i = 1, 2, . . . . . (10) 


and the on the right-hand side constitute n integrals of the 
system (6). In fact, if we replace the x’s in them by any solution 
of the system (6), (i.e. by a set of n functions obtained from the 
equations (8) by assigning particular arbitrary values to the 
constants t®), then each w necessarily becomes equal to the 
corresponding i.e. to a constant. 

The integrals of equation (7) obtained in this way are called 
principal integrals relative to t = t^^. From the definition it follows 
that 


Writing x instead of we see that a characteristic property 
of the principal integrals relative to ^ is that each of the 
functions {x | t) reduces to the corresponding variable x when 

t == 

Without undertaking a detailed study of the n principal 
integrals, we may at least show that none of them can be expressed 
as a function of the others only; i.e. that considered as n functions 
of the n + 1 variables x and t, they are independent. For this 
it is necessary and sufficient that the functional matrix (with 
n TOWS and n + 1 columns) of the tv's with respect to the x^s 
and t shall have n for its characteristic; i.e, that the matrix shall 
contain a determinant of order n which is not zero. Now if we 
take the determinant 

/u\ . . . wA 

3^2 ■ • • ^n/ 


( 11 ) 
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and apply to it the same considerations as we have already used 
for Z), we find that it = 1 for ^ ~ (since then Wi = Xi) and 
therefore there is a region round in which it is not asero; hence 
the characteristic of the^ matrix is n and the principal integrals 
are independent. 

4. Independent integrals* General integral. 

More generally, n integrals of equation (7) are 

said to be independent if the functions v^ix | ^) (i = 1, 2, . . . n) 
are independent. Of course every function 

F{t\, i?2, . . . V J (12) 

of the v's only is also an integral, as follows immediately from 
formula (3), remembering that for every with the operator 
A as defined by (7'), we have 

Av^ = 0. 

But the reciprocal theorem is also true, arid every integral 
of equation (7) can be put in the form (12), which therefore repre- 
sents the general integral of equation (7). 

To prove this, let / be an integral of equation (7); then the 
n + 1 equations 

0, 

0 (i = 1, 2, . . . n), 

linear and homogeneous in the n + 1 quantities 1, 

which do not all vanish, will be satisfied. The determinant of 

their coefficients must therefore vanish, i.e. 

t Xy . . . xj 

This means that/, Vj, v^, . . . are not independent. As the 
w’s are independent, one of the determinants of order n of the 
functional matrix relative to v,, v^, . . . v,^ is certainly not zero. 
But this is the case considered in Chapter I, pp. 5-8; hence we 


Avi -- 


a/ , ^ a/ 

dt ^ 7‘dx, ^ 
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conclude that /can be expressed in terms of the v^s only, without 
involving t or the x's. 


6. Direct study of the most general linear homogeneous partial 
differential eguatiom 

As a consequence of the relations which we have shown to 
hold between linear homogeneous partial differential equations 
of the first order, and systems of ordinary differential equations 
of the first order, we can in every case reduce the integration of 
an equation of the former kind to the integration of a system of 
the latter. In fact, equation (7) differs from the most general 
possible equation only in having one of its coefficients equal to 
1 ; but it will be at once obvious that every linear homogeneous 
equation of the first order can be reduced to this form — an 
elementary remark which w^e shall examine in more detail. 
Consider the equation 

Af = 0 .... (13) 

1 




in N independent variables . 

At least one of the a’s, say a^, will be different from zero. 
We may therefore divide the equation by As a result of this 
step, is in what we may call a privileged position (the co- 




efficient of being reduced to unity); it is therefore natural 


to denote it by a special symbol. Calling it t, and introducing the 
symmetrical notation ccg, . . - a?,, (n = N — 1 ) for the remain- 
ing variables z, equation (13) becomes 




8 / 


dt 


I Ojy 0a:, 


0 , 


which will coincide term for term with equation (7) if we put 
^ ^ X; {i = 1 , 2 , . . . n). 

Or y 

The corresponding system of equations ( 6 ) will thus be 
dxi 

dt 


(i 1, 2, . . . n). . . . (14) 
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Integrating these n equations, in which t is considered as the 
independent variable, we get the functions (< | sP), which reduce 
to X; when t = solving for each of the x®’s, the resulting 
expressions 

■w^{x\t) = Wi{z) 

will be the n principal integrals of equation (13) relative to < — 
and we may take 

F{Wi, . . .w„) 

as the form of the general integral, where the symbol F denotes 
an arbitrary function. 

It will frequently be quicker not to find the principal integrals 
tv, but to obtain n independent integrals of the equa- 

tions (14) in any way whatever. The general integral is then at 
once expressible as an arbitrary function of the v’s only, in the 
form 

F{vi, Va. . . . w,.). 


We have seen that we may choose as independent variable 
(for the system of equations (14) ) any one of the variables z 

which contributes an actual term to equation (13) (the corre- 
sponding coeificient a not being zero). The choice may be deter- 
mined by reasons of convenience for the particular case concerned. 
In order to avoid prejudging the case before the necessary reasons 
for our choice appear, we may write the equations (14) in the 
form 

dx, _ dt 

{% -- 1 , . n), 




a 


'N 


or, returning to the original notation, 
dz-^ dz^ dzj^ 


(15) 


It will be seen that these can be at once written down from 
the given partial differential equation; it should be noted that 
if any of the a’s is zero, the differential of the corresponding 
variable must also be equated to zero. 
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Examples: 

( 1 ) Take the equation in three variables 

+ y¥ + 0 ( 16 ) 

OX dy oz 

The corresponding system of two ordinary differential equa- 
tions is 

dx dy __ dz 
X y z" 

i,e. d \ogx ~ d logy = d \ogz. 

Writing these in the form 

d logy — d loga;: = 0, 
d \ogz — d logcc = 0, 

which is the same as 

rflog^ 0, 
x 

dlog- = 0, 

X 

we get 

^ = Cj, ? Cg (cj^, Cg constants). 

X X 

Hence two independent integrals are 

y s 
> ■ > 

X X 

and therefore the general integral will be 



This is merely the most general homogeneous function of 
degree zero. In fact, if y, z) denotes the latter, then by 
definition we must have, for any value of A whatever, 

<f>{Xx, Ay, Xz) = ff>{x, y, z)\ 
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and therefore, putting A -- , 

X 

y, z); 

\ X 

(f> is therefore in effect an arbitrary function jP of 

X X 


We have thus found for a particular case (which can obviously 
be generalized) Euler’s well-known theorem on homogeneous 
functions. 

(2) Take the equation 


df 

dx 

X 

a 


K 

dy 

y 

b 


9/ 

dz 

z 


= 0 , 


c 


where a, 6, c are constants which are not all zero. Putting 

X y 1 

'L i, r - 1 I. Z - I , L 

a b 


\y 

1 1 

1 

Z X 1 


. z = 

\b c ' 


c a 1 


the equation may be written in the form 


vf’/ , y^/_, 7^/ _ 


0 . 


and the corresponding system of ordinary differential equations 
is 


<Jx dy _ dz 
X f “ Z 


Taking x as independent variable, we shall have to integrate 
the two equations 

dy _ Y dz _ Z 
dx X' dx X 


But we can find two independent integrals more easily by 
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another method. In fact, from the form of the equations defining 
Xy Yy Zy wc scc tfaut tliB equatious 

xX+yY+zZ = 0-^ 

aX+ bY + cZ = 0 ! ' ' ' ■ ' ' 


are satisfied, since the given determinant vanishes if the elements 
of the first row are replaced by x, «/, Zy or by <2, b, c; expanding, 
we find that the equations ( 18 ) are identically true. 

Now if dxy dyy dz satisfy equations ( 17 ), i.e. are proportional 
to Xy Yy Zy wc Call substltutc them for X, T, Z in equations 
( 18 ), so getting 

xdx -1- ydy + zdz -- 0, 
adx + bdy -I- cdz — 0. 

The left-hand side of each of these equations is an exact 
differential; hence, integiating, we get as a necessary consequence 
of equations ( 17 ) 

?r + 2 :^ - ^1. 

ax -~ 1 - by CZ ■ - Cg. 

These are two particular integrals of the system, which are 
certainly independent, since at least one of the coefficients n, 6, c, 
is not zero. The general integral is thus 

F{x^ y^ 4- z^y ax by cz). 


Geometrical inter elation , — When there are three (or two) 
variables, the foregoing discussion can be given a geometrical 
interpretation in ordinary space (or in a plane). For this purpose, 
let any integral /(j;, y, z) of an equation 


{a) 


X 


% + Y% 4 ™ 0 

OX vy dz 


be considered as the parameter of a family of surfaces f = con- 
stant. By a suitable choice of the constant on the right-hand 
side of this equation, we can make one surface of the family 
pass through a point P arbitrarily cbosen in adv^ance; it is plainly 
only necessary to give tliis constant the value of / at the chosen 
point P, The equation (a) which / must satisfy expresses the 
geometrical fact that at any point P the normal to that surface 
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of the family which passes through P is normal also to the direc- 
tion of the vector ( X, Y, Z), given as a function of position. 
The system of equations associated with equation (a), namely 

( 6 ) 

^ X Y Z 


which expresses the relation of two of the variables to the third, 
represents on the other hand a property of certain curves. We 
know from the existence theorem that we can find (and in only 
one way) two functions x = x{z)y y -= y{z) which satisfy the 
system of equations (6), and which take values ajp, y^, arbitrarily 
fixed in advance, when z has the value Zq, which is also arbitrary; 
hence we can state that, given any point P, there passes through 
it one and only one curve which has the property expressed by 
the equations (6), i.e. that of being at every point in the direction 
of the vector ( X, Y, Z). An aggregate of curves such that one 
and only one of tliem passes through every f)oint of a given field 
is called a congruence. 

There is a very simple relation between the family of surfaces 
which represent the integrals of equation (a) and the congruence 
of curves which represent the solutions of the system of equations 
(&), namely, that each curve of the corupuence lies tvholly on a surface 
of the family. In fact, consider a point P(.r, y, z)^ and let L be 
the curve of the congruence, and 8 the surface of the family 
considered, which pass respectively through P. We shall show 
that a point wliich undergoes an infinitely small displacement 
along L, from P to a point P', does not leave the surface, or in 
other words that if the equation of the surface S,f{x, y^ z) — C, 
is satisfied by the co-ordinates x, y. z of P, it is also satisfied by 
the co-ordinates x + dx, y dy, z + dz, of P', The result is 
obvious, since for the co-ordinates of P' / becomes 

f{x, y, z) + Y dx+ ^ dy + dz, 

OX ay dz 


and as dx^ dy, dz, are by equations (6) proportional to X, Y, Z, 
the increment of f is proportional to 


dx ^ dy 




which vanishes by equation (a). 
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The surfaces considered are therefore formed by curves of 
the congruence; these curves are called their chamclerislics. 

We can of course avoid tlie use of infinitesimal displacements 
in proving the property that every curve of the congruence lies 
wholly on a surface / = constant. The argument, which is 
essentially the same as before, will be as follows. Let the vari- 
ables X, y, z, in the expression f(x, y, z) be considered as the co- 
ordinates of points of a curve L of the congruence, and let A 
be any parameter, e.g. the arc of the curve, which fixes definitely 
the position of a point on L; x, y, z, are thus considered as definite 
functions of A. Substituting these functions of A for x, y, z in 
the expression /(a:, y, z), we shall see that the result is independent 

of A, or that^"^ ~ 0. We have in fact 
dA 

df \ I 

dX dx dX dy dX dz dX* 

but by equations (6), along L, are proportional to 

dX dX dX 

A", y, Z, and /satisfies equation (a); hence : = 0. The fact 

dX 

that / remains constant along L is the algebraical equivalent of 
the geometrical property that the curve L belongs to a surface 
f = constant. 

6. Integrals of a total differential system, and the associated 
system of partial differential equations which determines them. 

Starting from a system of ordinary differential equations, 
we have succeeded in integrating tlic most general linear homo- 
geneous partial differential equation of the first order. By an 
analogous procedure, starting from a system of total differential 
equations, we shall succeed in integrating the most general 
system of linear homogeneous partial differential equations of the 
first order. 

Consider the system of equations 

du^ — (a = 1, 2, . . . m). • (19) 

1 

We shall apply the term integral of this system to every 
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function /(a? | u) which is such that it reduces to a constant when 
the ii’s are replaced by any solution of the equations (19). 
Differentiating / we get 

af df 

df= 

I dXi^ 1 du^ 

and, if the ti’s are solutions of the equations (19), 



The necessary and sufficient condition for the vanishing of 
this differential, whatever the dx'^ may be, is evidently that the 
n equations 

=0 (i = 1, 2, . . . . (20) 

dXi 1 ou^ 

shall be satisfied. They must be satisfied not only when the 
?^’s are solutions of the equations (19), as is clear from what has 
already been said, but also identically, which can be seen in the 
same way as for the corresponding result in § 2 (p. 3fi). 

Introducing the linear operators 

( 21 ) 

1 Stta 

B, =- -1- Q, (i 1. 2, . . . n),. . (22) 

dx, 

the equations (20) may be written in the form 

BJ 0 {i -- 1 , 2 ,... n). . . (20') 

The system of equations (20) or (20') is said to be associated 
with the system of equations (19); the necessary and sufficient 
condition that / may be an integral of the system of equations 
(19) is that it should satisfy the associated system of partial 
differential equations (20), or, in other words, that it should be 
an integral of the associated system. 

7. Principal integrals, as typical cases o£ independent integrals. 

Suppose that the system of equations (19) is completely 
mtegrable, and let us assign fixed arbitrary values to the con- 
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stants ofi and in the field in which the are regular; then 
we know that there exist m functions, 

I (a == 1, 2, . . . m), 

regular in the region round the values w®, which satisfy the 
equations ( 19 ), and which beconae respectively equal to the 
assigned constants when ^ The equations so written 
are soluble with respect to the quantities in a region round 
the point x^, as can be shown by means of the same arguments 
as those of § 3 (p. 38 ). Suppose them solved; we can then express 
the s in terms of the and the and we shall write 

w,{x\u) = 

The id's are evidently integrals of the system of total differential 
equations ( 19 ), and are therefore also integrals of the system of 
partial diilerential equations (20); we shall now show that they 
are independent. 

Consider the functional matrix of the w*s with respect to the 
x's and the «/’s; we shall have to show that its characteristic is 
7 n, or, which comes to the same thing (since it contains no deter- 
minants of order > m), that it contains a determinant of order 
m which is not zero. Now the determinant 

/w^ 

\wi ^2 . . . uj 

becomes ~ 1 when x, — a;’', and therefore is different from 
zero in a region round that point; hence the required result 
follows. 

The m independent integrals w are called principdl integrals 
of the total differential system ( 19 ), or of the partial differential 
system (20), corresponding to the values of the independent 
variables x. We have thus shown that, with the hypothesis that 
the system of equations ( 19 ) is completely integrable, the system 
of equations (20) (or (20') ) admits of m independent integrals, 
which can be determined in an infinite number of ways; namely, 
the principal integrals just considered, which in general vary 
with the choice of the initial values 

Here too, as on p. 40 , we shall say more generally that the 
m integrals of the system (20) are independent 

< D OftS ) * 
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if the functions v(x | u) of the n + m variables x and u are 
independent. 

8. The general integral. 

By a property already noted of linear operators, if we con- 
struct any function whatever of the v’s, 

F{v^, V 2 , . . . v,,,), .... (23) 

we get a new integral of the system of partial differential equations 
(20). In addition, for the system of equations (20), as before for 
the single equation (7), the most general function which satisfies 
the system is included in the expression (23); or this expression, 
when F is considered as an arbitrary function, constitutes the 
general integral of the system. 

To prove this, let f(x | u) denote any integral of the system 
(20), and consider the functional matrix of the ,rt + 1 functions 
(of rn n variables) v^, . *- . f: 


dv^ 




dxi 

dXn 


9m, '« 

1 


dv„. 

9v,„ 

dxi 

dx„ 

Zu^ 

■■■ 9m;,; 

df 


9/ 



'dx„ 


9m,„ 


If we can show that the characteristic of this matrix is 
it will follow that there exists {rn -j- 1) — m, or 1 , relation between 
the m + 1 functions which does not involve the or the 
(cf. § 7, pp. 9“12). This relation must necessarily contain / ex- 
plicitly, since tluTC can be no relation connecting the alone. 
We can therefore solve it for/, whicli will have the form (23), 
so giving the required result. 

We shall first make a slight change in the form of the matrix 
My by maldng it contain only derivatives with respect to the 
w’s. This is easily done, for since 

Btv^ = 0, Bif = 0 , 
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we get from formula (22) 


dv^ 

dxi 




3/ 

dX: 


The matrix thus becomes 




M = 






^CIJ ... - QJ 


-aj. 


dv^ 


du^ 




Smi ■ ■ ' 

'du,„ 

9/ 

Al 

du^ 



To prove that the characteristic is we have to prove: 

(1) That every determinant of order m + 1 (the highest 
order possible) vanishes; 

(2) That at least one determinant of order m does not vanish, 
A generic determinant of order ^?^ + 1 will be formed by 

taking all the m 1 rows, and m + 1 columns chosen arbitrarily 
from the m + n of the matrix. These m + n columns are of two 
types: the first n contain the operators the remaining m the 


operators _ ; let r columns be taken of the first type, and s of 
du 

the second, with of course r s == m + 1. Now in order to 
write down in a perfectly general form a row of this determinant, 
which will contain either the v's or (if it is the last row) /, we shall 
use the symbol <f> to denote either one of the v’s or /; we can then 
write the row as follows: 




Cy A H 



where the suffixes ^2> • • - constitute any arrangement of 
r numbers, chosen from 1 to n, and the suffixes ifej, L2, . , . any 
arrangement of s numbers, chosen from I to m. Remembering 
the definition of Q given in (21), we see that each of the first r 
elements of the general row is a linear combination of the other 
elements; or, as we usually say, that the first r columns of the 
determinant are linear combinations of the other columns. The 
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determioant can thus be broken up into a linear combination of 
determinants of order w + 1 in which all the columns are of the 

d<f,\ 


second type ^i.e. are composed of terms 


du) 


But there are in 


all only tn columns of the second type; it is therefore impossible 
to choose rn + I of them without repeating at least one. It 
follows that in each of these partial determinants there are at 
least two columns equal, and therefore these determinants all 
vanish. The general determinant of order m + which is a 
linear combination of them, must therefore also vanish. This 
proves the first of the required propositions. 

The existence of a non-vanishing determinant of order m 
is a direct consequence of the hypothesis that the integrals v 
are independent. 

We have therefore proved completely that the characteristic 
of M iB 7n, and therefore that / c«an be expressed in terms of the 
independent integrals i.e, that / has the form given in (23). 


9. Direct study of the most general system of linear homo- 
geneous partial differential equations of the first order. Complete 
systems. Jacobian systems. 

Let us consider a generic system of n linear homogeneous 
partial differential equations of the first order, in N variables, 
and with only one unknown function: 

A,f - - 0 1,2, .. .n). . (24) 

1 

We shall suppose that these n equations arc independent, 
and we can thereforij assume n < N. In fact, if n > iV, the 
equations, which w^e have supposed independent, considered as 

algebraic equations in the N quantities , would be mutually 

df 

mconsistent; and if n N, tliis w^ould imply that -= 0, 

or / == constant. Further, it is clear that every/ which satisfies 
equations (24) must necessarily also satisfy the following \n{n — 1) 
equations (obtained by constructing all possible Poisson’s paren- 
theses with the given operators): 

(^^, Af,)f 0 (A, * 1, 2, . . . n). . (25) 
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These ore differential consequences of the given system. Since 
derivatives of the second order disappear from equations (26)> 
it may happen that these equations, or some of them, are also 
algebraic consequences of the system, i.e. that they can be obtained 
algebraically by taking a linear combination of the n given 
equations. 

If all the equations (25) are algebraic consequences of the 
system of equations (24), this system is called complete. 

In the opposite case, consider the system formed by adding 
to (24) those of (25) which, together with (24), are linearly inde- 
pendent. The new system will be equivalent to the original one, 
and will contain one or more additional equations. Repeating 
the same procedure for the new system, and so on, we shall resell 
either a complete system or else a system in which the number 
of equations is equal to or greater than N — the case of mutual 
inconsistency, as already noted at the beginning of this section. 

We need therefore only consider complete systems. The 
condition of completeness can be written in the following form: 

(Af^, Af^)f — phki Aify . . . (26) 

1 

where the coefficients p denote functions (a priori of any form 
whatever) of the independent variables z. From the definition, 
and applying identity (5), it follows that the coefficients p satisfy 
the relations 

PhH ^ — Pm {h, h, I =r ], 2, . . . n). 

A particular case — of special importance — of a complete 
system is that in which all Poisson’s parentheses are identically 
zero (i.e. all the coefficients p are zero); when this is so the system 
is called Jacobian. 

10. EQuivalence of every complete system to a Jacobian system 
with the same number of equations. Note on Cramer’s rule. 

We propose to show that a complete system can always be 
replaced by a Jacobian system with the same number of equations; 
thus the consideration of any complete system can be reduced 
to that of a Jacobian system. 

Starting from the system of equations (24), we shall suppose 
that it is complete] i.e, that the equations (26) are satisfied. We 
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shall adopt the following procedure: we shall construct n distinct 
linear combinations of the n given equations, 

BJ = hcacAJ =0, ... (27) 

1 

with the condition 

II ^ik I ^ ® (i = 1, 2, . . . 7l), 

and we shall choose the coefficients c in such a way that the 
system (27), wliich is equivalent to the given system, may be 
Jacobian. 

Before doing this, however, we shall write the given equations 
in a slightly different form. We know that the matrix of the 
a’s has its characteristic equal to n (sinee the equations are inde- 
pendent); let us arrange the variables in such an order that the 
determinant a formed by taking the first yi columns of the matrix 
may be that which does not vanish (or one of those which do 
not): 

®11 ®12 • • • 

®21 ®22 • • - ^2u , ^ 

a = 4= 0. 

®ii2 • • • ^nn 

We shall next divide the variables z into two groups: we shall 
call the first n of them x^, Xg, . . . x^, and the remaining N — n 
— m we shall call With this notation, the given 

system can be written in the form 

k y + VJ= 0 (A = 1, 2, . . . n), 

1 

where 17*. denotes an operator involving only derivatives 
with respect to the tt’s, the explicit expression of which does 
not for the moment concern us. Now solve ^ these n equa- 

^ Cramer’a well-known rule may be put in the following form, which we shall 
frequently use, here and elsev here. 

Let there bo given n linear eijiiations 
n 

^v^vk fi' "Off 1, 2, . • . n)f . • . • . (a) 

such that the determinant a of their coefficients is not zero. 

We shall denote by the reciprocal elenmnit of the generic element ar« of the 
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tions with respect to the terms Putting them in the form 


¥ ^ 


S . ."1 = - U^f, 


multiply each equation by the corresponding a’* (the reciprocal 
element of in the determinant of the a’s) and sum with respect 
to k from 1 to w. We thus get n linear combinations: 


dxi 


{i = 1, 2, ... n), 


which are independent, since by a well-known result the deter- 
minant of the coefficients is equal to and therefore is not 

a 

zero. These equations can be written in the more concise form 

S/ 


dx. 


+ ^0 (i — 1, 2, ... n), 


(24') 


where the Q/s represent linear operators containing, like the V% 


detunninant a; i.e. the algebraic corajilemeiit (or niiiuir) of ar» divided by a. 
T>ien, applying two ordinary theorems on determiiiaiit», and indicating by SJ either 
zero or unity, according as r 4= ^ or r = », wc get 


= (/3) 

Sft = «{ (/S') 

1 ^ 


Applying these properties, the e<iuathm3 (a) can be solved by constructing 
suitable linear combinations of them, l^or instance, to find ^4, multiply the /:th 
eipiatioii lr>y ; then giving k all values from 1 to n, and summing, we get 

I*- = Zfr 

The left-hand side of this equation can be transformed as follows : 

= ii' 

hence the solution is given by the formula : 
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only derivatives with respc^ct to the u% and therefore of the 
form 


jn 


I 



The system of equations (24') is equivalent to the original 
system (24). The only formal simplification is the specially 

0 

simple way in which the terms in occur. But we shall show 

ox 

that the system (24') has the advantage of being both complete 
and Jacobian; it therefore constitutes precisely the system we 
are in search of, containing n linear combinations which we 
have denoted in equations (27) by the operators the co- 
efficients of (27) will be identical with the coefficients 

We shall first show that the system (24') is complete. We can 
write it shortly in the form 


where 


BJ - 0 , 


jg. 


d d 


(24") 

(28) 


Since the operators B are linear combinations of the A’s, it 
follows from a theorem proved above on p. 35 that Poisson^s 
parentheses B^)/ are linear combinations of the expressions 

(^//? 

Now since the system (24) is complete, it follows that the 
expressions -4,,)/ are in their turn linear combinations of 
the expressions Af; so that the operators {B^, Bj) are seen to be 
linear combinations of the alone. But the A's are linear 
combinations of the B’s (since the JS’s are independent com- 
binations of the ^’s); hence ultimately the operators (J5,, Bj) 
are linear combinations of the B^s. In other words, the system 
(24") also is itself complete. 

We can therefore write 

(Bi, B,)f= .... (29) 

1 


where th^ coefficients y are analogous to the of formula (26). 
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To show that the system (24") is Jacobian, we must prove 
that all tJie coefficients q vanish. We note that both sides of 

equation (29) are linear in the terms , and the identity cannot 

hold unless the coefficients of the same derivative, e.g. _ , are 

dxf, 

the same on both sides. We proceed to find these coefficients. 
The left-hand side of (29) can be written in the form 


We saw on p. 36 that the result contains no second-order 
derivatives; it is therefore unnecessary to apply the operator B 
to the derivatives of /, so that the expression in question reduces 
to 


BMf- S 




du. 


BjX^ 



As this contains no terms in — , it follows that the coefficient 

df 

of every - - is zero. On the right-hand side the coefficient of the 

corresponding term is 9,^7, (remembering the definition of JB); 
hence every 5 — 0, and in consequence 

(B, B^)f 0, 


or the system (24') is Jacobian, 

A further remark which will shortly be useful is that from 
the vanishing identically of each side of equation (29) and from 

0 

equation (30) it follows that the coefficients of the terms in 
also vanish, or from (30), ^ 

0 (31) 


11. Integration by means of the associated system. 

Gathering up the foregoing results, we now see that, given a 
system of linear homogeneous partial differential equations of 
the first order, we can find its general integral— if one extsts — 
by means of the integration of a complete system of total differ- 
ential equations. 

( 11 ^ • 
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We have seen how to transform the given system into a com- 
plete system (if it is not so already, and provided it contains no 
inconsistency). We now note that the Jacobian system (24') 
which we reached as a result of transforming the generic complete 
system (24) for other purposes, is identical with the system (20), 
which originally arose as the system associated with a generic 
system of total differential equations. The important point here 
is that if with the coefficients X belonging to the system (24') 
we construct the system of total differential equations (19), this 
system is completely integrable. 

In fact, the condition for this is that 


{i,j = 1, 2, . . . a = 1, 2, ... w), 

dx, dx. 


dx,f "dx.„ _ . 

” “a*, + duf ^' 1 ' - ■ sx. + > aS7 


and remembering the definition (28) of the operators B, these 
can be written shortly in the form 

\r 

The equations (31) show that the X's obtained from the 
system (24) satisfy these conditions. 

Having transformed the given system into the form (24'), 
we need therefore only construct the associated system (19) 
and integrate by the method given in the preceding chapter: 
the most general solution will be obtained in the form 

1 ^®) (a = 1, 2, . . . m). 

Solving these m equations with respect to the we get 
m = N — n principal integrals, and constructing any function 
whatever of these integrals we have the general integral of the 
given system. 

This systematic method of integration is in theory quite 
general and covers all possible cases, but it is somewhat laborious 
to apply. In practice it is often shorter to integrate the equations 
separately, and then to look for the m common integrals which 
certainly exist, w^hen we have ascertained beforehand that we 
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are dealing with a complete system. The following may be given 
as an example. 

Consider the system 


Af 

Bf~- 


+ *4 3 - = 0 . 




‘dx« 


da^ dx^ 


, 3 / a/ , df 

*2^„-+ *13^; - - 3^4^ _ + ^ S 


dxi 


*ax» 


dx^ 


0 . 


■ ( 32 ) 


We shall first show that it is not only complete, but also 
Jacobian. To show this as shortly as possible, we put 




dXo 


A d . d 


» _ ^1 ^ 

"1 


dx-i 


dx^ 


B, = 


'Sir, 


**;)*-; + 


dx^ 

d 


so that A -- Ai A 2 ^ 5 = and then construct 

the alternate function of the two given operators. We get by 
successive transformations 


{A, ABf-^^ BAf 

— A^B^f + A^B^f -f- A^B^f + A^B^f 

— B^A^f— BjA^f— B^A^f — B^A^f 
- B^)f+ {A 2 , B^)f+ (A,, B2)/+ (^ 2 , B 2 )/. 


Now it can be shown directly that 


(^ 1 , B,)/ --== 0, (^ 2 , B,)/ = 0, 


and interchanging Xi, Xa, and X 3 , x^, it follows that 

(A 2 , B^)/ - 0 , (^ 1 , B^)/ - 0 . 

Hence 

{A, B)f ^ 0, 


which means that the system is Jacobian. It will therefore have 
4 — 2 2 independent integrals, or rather (cf. p. 40) an infinite 

number of pairs of such integrals. 

To find one such pair, note that the first equation (which is 
of the type considered in the example on p. 43) has as its general 
integral any homogeneous function of degree zero in the variables 
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ajj, iCj, ajg, Xf, We need therefore only find two independent 
integrals of the second equation which are homogeneous of 
degree zero. 

Now the system of ordinary differential equations associated 
with the second of the equations (32) is 

dxi __ dx^ dx^ _ dx^ 

The equation formed of the first two of these terms can be 
integrated immediately, and gives 

^ 1 ^ + ir./ ^ a^; (33a) 

similarly the other two terms give 

^ 3 " + ^ (336) 

where a and 6 denote constants. 

Equating the first and third terms, after substituting in them 
for X2 and the expressions given by equations (33a) and (336), 
we get 

dxi _ dx^ 

s/ efi — %/ 

and therefore integrating 

sin ^ — sm / — c, 
a b 

where c is a third constant. 

This last integral can be put in the form 

• -1 -1 

sin ‘ ^ . . — sm , ^ - --= = c. 

x/ X./ >/ -f- x^ 

We also get from (33«) and (336) 

Xy x^ a? 

~ ’ 

Of the four integrals thus found, the last two, (33c) and 
(33d), are homogeneous of degree zero, and are therefore also 
integrals of the first equation; and it would be easy to verify 


. (33c) 


(33d) 
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that they are independent. Hence the general integral of the 
system, of equations (32) is 

/^sin-^ 

where / is the symbol of an arbitrary function. 




- . — sm ' — , - 

V + ^4^ ^3“ + ^4 




CHAPTER IV 

Algebraic Foundations of the Absolute 
Differential Calculus 

1. Effect on some analytical entities o£ a change o£ variables. 

This chapter is devoted to the study of the effect on some 
analytical entities of a change of variables. In thivs first section 
we propose to give some examples showing the nature of the 
general considerations which will be subsequently established. 

Consider n independc^nt variable's which w e shall 

as usual denote collectively by x, and suppose a transformation 
ap]>lied to them whicli leads to another set of n independent 
variables x; it is understood that the transformation used is 
reversible, i.e. that the transformation formulae 

{i -= 1, 2, . . . n) . , . (1) 

can be solved for the x’s in the field considered, so that we have 
simultaneously the equivalent equations 

(P) 

The geometrical name for this oiieration is of course cfiange 
of co-ordinates; to fix the ideas, we may take n 3, so that wc 
are passing from Cartesian orthogonal co-ordinates x, y, z to 
three generic independent combinations of them (curvilinear co- 
ordinates) q 2 > ? 3 - 

Now suppose that in dealing with a physical, geometrical, 
or other question we find that we have to consider riot only the 
variables x, but a certain aggregate of entities connected with 
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them. For instance, in a certain region of physical space referred 
to Cartesian co-ordinates x, 2, let the temperature T be defined 
at every point; then it is a determinate function of x, z. Or 
we may suppose that a field of force exists in the given region, 
and we shall then have to consider at every point a vector, and 
hence its components, i.e. three functions X, F, Z of x, 2. 
Now change the variables. We have to find some way of expressing 
the same quantity or physical phenomenon (temperature, force, 
&c.); for this purpose we find that we have to introduce certain 
parameters which in the new system of reference will with advan- 
tage take the place of those which were more suitable when we 
were using Cartesian co-ordinates. These new parameters are 
naturally called transforms of the original ones; they are obtained 
from them by a law which cannot be assigned a priori^ but depends 
on the nature of the problem, and in part on suitable conventions. 
For instance, in the new system the temperature T will be a 
function of g;j, such that the same temperature belongs 

to the same point of space, whether the calculations are made 
with the original or with the new variables; hence T as a function 
of the g’s will be obtained by substituting for x, y, 2 in T(x, y, z) 
their values in terms of gg, go- This kind of behaviour, which 
is the simplest we shall have to consider, is called transfonnatum 
by invariance] all functions of position which have a value inde- 
pendent of the system of co-ordinates chosen are transformed in 
this way. 

With the components of a vector, in the other example cited, 
this does not happen. If in fact, as we may suppose, the vector 
has a magnitude and a direction which are independent of the 
system of co-ordinates chosen (we shall think of it as being 
defined physically as a force), its components, on the contrary, 
even when the point considered remains unchanged, change their 
values when the frame of reference is changed. This is obvious 
in the case of a rotation of (Cartesian axes. If, however, the trans- 
formation considered is not of this particular kind, we do not 
know a priori what to substitute for the projections X, Y, Z 
of the vector on the axes of the old system in order to specify 
the vector in the new system;^ i.e. we have to determine the law 

^ We sbaU see in various parts of Chapter V how the introduction of new 
variables 71, 7a, 7s gives rise geometrically to corresponding co ordiriatc surfaces 
= constant, 7^ = constant, 7s ~ constant, and co-ordinate lines which are their 
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of transformation which will meet the needs of the case in ques- 
tion. The most suitable criterion to take as a guide in making 
our choice is found by introducing, alongside the given vector, a 
scalar quantity with a physical significance which is transformed 
by invariance. In this case we take two infinitely near points 
whose co-ordinates differ by dx^ dy, dz\ then the work of the 
force whose components are X, Y, Z, in passing from one of 
these points to the other, will be 

dW Xdx + Ydy + Zdz; ... (2) 

this scalar quantity has a physical significance which is invariant, 
and it can therefore be concretely determined. From the mathe- 
matical point of view it is an important fact that with any system 
of orthogonal axes Oxyz the Cartesian components of the force 
are identical with tlie coefficients of dxy dy^ dz in this expression. 
Changing to the curvilinear co-ordinates yj, we can find 

the resulting values of dx^ dy, dz by means of the differentials 
of the new variables, using the formulae 

dx = S, &c. 

1 dq, 

The work dW will take the form 



which is analogous to formula (2). 

In fact, putting 

Y + 1--Z g, {i - 1. 2, 3), . (3) 

dq, dq; dg,- 

we get dW — dq^ + Qa <^2 + Qz <^3- 

intersections. BearinjLj this in mind, if we projiosed to use i^eoinetrieal criteria 
taken from our co-ordinate system in oifler to specify the elements which deter- 
mine a vector, we should find ourselves faced by four possibilities, all etjually 
acceptable, and with one or another preferable according to circumstances. At 
every point, in fact, the tangents to the co-ordinate lines and the normals to the 
co-ordinate surfaces form two supplementary trihedra, which are in general 
oblique-angled, and therefore distinct; and a vector may l>e defined either by 
its orthogonal projections on, or by its components along, either of these two 
trihedra. 
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The quantities Q^, here hold the same position as did 
X, Y, Z in Cartesian co-ordinates; it therefore seems suitable 
to call them the components of the force in the new system of 
reference, so that we may say that formula (3) represents the 
law of transformation of the components of a vector. This law 
is called covariance. 

AVe can also reach this law from a different point of view, 
which, however, we shall show in a moment to be really a particular 
case of the preceding argument. Consider an invariant function 
u{x, y, z)\ we shall try to find the most convenient law of trans- 
formation of its three derivatives which are evidently 

ox oy az 

functions of x, y, z. A natural course is to consider the three 

derivatives^-^, ^ as being the expressions which correspond 
032 0?a 

to them in the new system of reference; these are of course given 
by the ordinary formulae 


du 

9 ?. 


du dx du dy 
dx dq^ dy dq, 


du dz 
dz dqi 


{i = 1, 2, 3). (4) 


If instead we were to assume transformation by invariance, 
the three quantities we arc considering would represent deri- 
vatives of a function only in the original Cartesian system of 
reference, while in any other they would in general lose this 
special property. 

The formulae (4) are evidently a particular case of the formulae 
(3), in which the derivatives of a single function u have been 
substituted for tlie components of the generic vector. The real 
reason for tliis is found in the fact that the law of persistence 
of the derivatives can also be included as a special case of the 
invariance of a linear differential form. As a particular case, we 
need merely replace dW (which is not in general an exact differ- 
ential) by the total differential du, which may be expressed in 
either of the two forms 
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The foregoing remarks will suggest what it is we propose 
to do, though naturally this will become clearer as we 
proceed. 

Given in a certain system of reference (which may be of any 
kind whatever) a set of quantities having a certain significance, 
physical, geometrical, or other, we assign a law of transforma* 
tion by means of which a set of quantities having the same 
significance is associated with any other system of reference, 
and we are thus led to introduce a set of parameters, collectively 
independent of the system of reference, whether Cartesian or not. 
This is the basis of the conceptual importance and the fertility 
of the considerations which we propose to develop. 

2. m-fold systems. Forms of degree m and m-ply linear 
forms. 

We shall first define a system of order m or mfold system. 
We apply the term to a system of numbers 

A . 

which are such that a one-to-one correspondence with a specific 
law exists between them and the set of m integers 
where each (^f the z’s can take all integral values from 1 to n. 
The number of elements of an m-fold system is thus n"', this 
being the number of permutations (with repetitions) of n numbers 
taken m at a time. It is not necessary that these elements 
should be all different. 

A system composed of a single number (which may be repre- 
sented by a letter without a suffix) may be considered as a system 
of order zero. A simple (one-fold) system will be the aggregate 
of n elements which can be represented by the notation 

A, (i = 1,2,... w); 

e.g. the set of three components of a vector, for which m = 1, 
w 3. 

A double (2-fold) system will be of the type 
^0 (». 3 ^ 1, 2, . . . n), 

and will consist of n® elements; and so on. 

A system of order greater than 1 is called symmetrical if all 



66 


INTRODUCTORY THEORIES 


the elements in it which differ only as to the order of their suffixes 
have the same value; e.g. for the case w — 2, if Aji ~ Ajj. 
A system is called antisymmetricul {skew) if when two suffixes 
are interchanged the element changes its sign but not its value; 
again for the case m = 2, if A,j = — A^^. The n coefficients 
« of a generic linear form^ 

n 

1 

constitute a simple system, which is in fact the most general of 
its kind, since, given n quantities u„ it is evidently always possible 
to consider them as being the coefficients of a linear form 

Consider next a quadratic form, which we may write as 

4> = 'L,jAijX,x/, 

as the sum includes all permutations of the suffixes two at a time, 
the product of x, and Xj will occur twice, once as and once 
as XjXiy so that the coefficient of the product is f- Aj . This 
is unchanged if i and j arc interchanged; hence we se(‘, that the 
coefficients of a quadratic form constitute a symmetrical double 
system, which is the most general possible. But if we wish to 
determine a generic (non-symmetrical) double system by means 
of the coefficients of a form, a quadric in the independent vari- 
ables X is no longer sufficient. We shall now require two different 
w-fold systems of independent variables, e.g. the co-ordinates 
X and x' of two points between which no a priari relation exists, 
and we must construct the expression {bilinear form) 

11 

F = S,; A,J X; Xj, 

which is linear in both the x’s and the 5r'’s; the coefficients of this 
form are the required arbitrary quantities Afj. 

More generally, it is easy to see that a generic m-fold system 
is determined by a multilinear form of m groups of variables, 
while the coefficients of a form of degree m constitute the most 
general symmetrical m-fold system. 

* The terra /orm with res^xict to given nrcfuments (e.g. the indej^endent variables 
flCi, aca, , . . Xrt) means a fiolyiiomial homogeneous in those arguments. 
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3. Invaxianoe, covariance, and contravariance of a simple 
system with respect to linear transformations. Dual variables* 

We now proceed to examine the laws of transformation of 
systems. We shall at first limit our investigation to a linear 
change of variables and a simple system %, u^, . • . u,,. 

We shall suppose that we can pass from the variables 
X to the new variables x, and vice versa, by means of the 
formulae 

n 

37^ S/- {i 1,2, . . . ri), . • (5) 

i 

X, ^ {i 1, 2, . . . n), . . (5') 

1 

where the coefficients c are arbitrary constants whose determinant 
is not zero; the second formula follows from the first by applying 
Cramer’s rule, so that d"' is the reciprocal element of (cf. 
p. 54, footnote). 

The most obvious hypothesis to make is that the ?i’s are 
functions of position which arc transformed by invariance 
(cf. § 1). 

We get a slightly less simple, but remarkable, case if we sup- 
pose that the us are transformed by the same law as the co- 
ordinates, in which casc^ the w’s will be called contravariants. In 
particular, the co-ordinates themselves form a contravariant 
simple system. 

Next suppose that tlie u*s are the coefficients of a linear 
form 

n 

<f> rrz S, M, X„ 

I 

and that </> is transformed by invariance, i.e. by substituting for 
the x’s the expressions (5), so that (f> is also a linear form of the 
new variables x. We shall take the coefficients of this new form 
as the transforms u of the w’s; we shall then say that the u’s 
form a covariant system. 

Writing out the expressions in full, we have 

H H n n 71 

^ S;. Uf Xff. — X4. 

11 1 11 
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The new coeificients are therefore 

n 

Ui. 

1 

Interchanging i and k, so as to get the formulae in the same 
shape as (5'), we get 

n 

Uf = (i = 1, 2, . . . n), 

1 

which gives the law of covariance. 

Here, too, we naturally add the equivalent formute, which 
are obtained by solving for the original elemente u, and are given 
by the usual formula (Cramer’s rule). Writing them first, so 
that we get them in the order corresponding to that of (5) and 
(5'), we have finally the law of covariance expressed by the two 
groups of equivalent formulce 

Ui ( 6 ) 

1 

n 

Uf = 'LkC/.iU,, = 1, 2, . , . n). . (6') 

1 

We shall frequently consider, together with the variables x 
(which are also called point variables), a system of covariant 
variables w (called dual variables); the behaviour of both sets 
of variables when a linear change of variables is made is shown 
by formulae (5) and (6). 

To find a geometrical interpretation of dual variables, we may 
fix our attention on the case n = 4, in which x^, x^, can 
be considered as homogeneous Cartesian co-ordinates of the 
points of space. A plane has an equation of the type 

+ ^2 ^2 + ^3 = 0, . . (7) 

where with the usual terminology, the coefficients U 2 , 
are Plticker’s co-ordinates of the plane. Now, given the geometrical 
significance of equation (7), its left-hand side must be invariant 
(except for a non-essential factor, the co-ordinates being homo- 
geneous), and hence the Plucker’s co-ordinates u must be trans- 
formable by covariance. From the well-known law of duality of 
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ht: 

projective geometry the u^& have been given the name of dual 
variables. Analogous results hold for any value of n. 

4. Invariance, covariance, and contravariance of an m-fold 
system with respect to linear transformations. Mixed systems or 
tensors. Vanishing of a tensor an invariant property. 

We shall now extend the discussion of the preceding section 
to systems of any order, but still limiting it to the case of linear 
transformations of the type (5), (5'). We thus define mixed 
systems, of which covariant and contravariant systems are 
particular cases. 

Consider m sets of n point variables (i.e. m points). Denoting 
by an upper index the ordinal number of each point, we get the 
set of arguments 

a^, . . . 

af, a:-, . . . x^-. 


9 *^2 » • • • • 

Consider also a certain number /x of sets of n dual variables 


u\, 

vi. 

nA • 

• • • , 


9 

2 . 

HI, 

K 

- • • 


ul. 

• • • 


Construct a multilinear form F in all these variables, each 
term of F containing as factor an elemt nt taken from every set 
of the m and the /x w’s. The coefficients of these terms, which 
are a priori completely arbitrary, will constitute a generic system 
of order m H- /x. Writing the indices corresponding to the x’s 
below and those corresponding to the ?x’s above, we shall have 


n 

F ^ H 


1 


''I • 



hn Jfi 


( 8 ) 


Now transforming the a:’s by the law of contravariance and 
the m’s by the law of covariance, and substituting the expressions 
so obtained in (8) (i,e. transforming F by invariance), we shall 
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get a multilinear form of the new variables x, il\ we shall take 
the coefficients A of this new form as being the transforms of 
the coefficients A. We shall then say that the A's constitute 
a tensor or mixed system, ccYvariant with respect to the lower 
indices, contramriant with respect to the upper. In particular, 
m or fjL may be zero, leading to the absence from F of the point 
or dual variables resptjctively; then the system of coefficients is 
purely contravariant if the variables in F are all covariant, and 
vice versa. 

The case of the simple system comes at once under this 
definition. In fact, F in this case becomes the <f> of the preceding 
section; if we consider it as linear in the a:’s, we find that; the 
coefficients u, according to the definition just given, must be 
called covariants; while if it is considered as linear in the u’s, 
we conclude that the x's form a contravariant system, which 
agrees with the definitions already assigned. 

A covariant, contravariant, or mixed tensor, having m + /a 
indices in all, is said to be of rank m + jit; a simple system, either 
covariant or contravariant (i.e. a tensor of rank 1 ) is also called 
a vector, and its elements arc called respectively co variant or 
contravariant components of the vector. 

Following a similar method to that used in the preceding 
section to find the formula; (6) and (6'), we could find the general 
transformation formula? for mixed systems, and hence, in parti- 
cular, the formulae for contravariant and co variant systems. 
We shall not need these formulae, as in what follows we shall 
always go back directly to the definition just given. As an 
example, however, we propose to find tliem and give them in 
full for the simplest case of the mixed system, i.e. the system 
with a single index each of covariance and of contra variance. 

Consider therefore the bilinear form 

F = 'L,jA\xiUj, 

and transform it by invariance. Using formulin (5) and (G), we 
get 7 ( 1 ; n 

111 1 
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The coefficients of this new form are 




( 9 ) 


which gives the law of transformation for mixed systems with 
two indices. We should get similarly, for the most general 
mixed system, 



'II 






i C 
* iH nu 


JhJi 


. ..cV/x. (10) 


As a memaria technica, we may add that the transformation 
formulae for the a^’s and the u's give an easy way of remembering 
those for a tensor of any kind. The latter are always linear, and 
the coefficients are composed of the c’s in a similar way to those 
of (5) and (G): to each index of covariance corresponds a c with 
the indices below, to each index of contravariance a c with the 
indices above. The opposite holds in the inverse formula^. 

Wo may sum up the discussion so far in tlio following defini- 
tions. 

An m-FOLD CO VARIANT 'is an m-fold system which, is transformed 
in the smne way as the coefficients of a nmltilinear form in point 
variables; an m-FOLi) contra variant is one which is transformed 
in the same way as the coefficients of a multilinear form in dual 
variables; more generally, a mixed system or tpjnsor is one which 
is transformed in the same way as the (X)efficients of a multilinear 
form in both point and dual variables {including also as particular 
cases both purely covariant and purely cxmtravarianl systems). 

The indices of contravariance are generally written above, 
those of covariance below; an exception is however made for 
the variables .r, which are as usual denoted by sTj, rrg, . . . a?,,, 
with the indices below, even if, as in the present case, we are 
d(ialing with a contra variant system and linear transformations. 

We shall close this section with a remark which is as obvious 
as it is fundamental whenever the notion of a tensor occurs. 
This is the fact that if all the elements of a tensor, with reference 
to a certain system of variables, vanish, this necessarily also 
happens for the transformed elements which correspond to any 
linear change of variables whatever. This is an immediate con- 
sequence of the fact that the hypothesis makes the invariant 
form F vanish identically. 
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5. Syzometrical double systems. 

Since we shall have occasion later on to deal with a remarkable 
symmetrical covariant double system, we propose to give here 
some properties of systems of this kind. Let the elements of such 

a system be ,,,, 

^ ( 11 ) 

their covariance will be expressed by the fact that the bilinear 
form „ 

F{x I x') = ILy, Xi xl . . . (12) 

1 

is invariant in any linear transformation which changes the x^& 
and the into other sets of variables x, x\ 

We shall first show that such a change of variables leaves 
the symmetry of the system unchanged; i.e. that 


In fact, if we interchange the variables x, x' in the bilinear 
form (12), we get ,, 

F{x' 1 x) = 

1 

and since the right-hand side of this equation differs from that of 
equation (12) only by the non-essential interchange of the Jotters 
i and k, it follows that 

Fix' I x) = Fix I x/) (11') 

Vice versa, if this relation holds, conclude, by reversing 
the steps of the argiixnfsnt, that (11) is also true. 

Hence the condition of symmetry (1 1) is completely equivalent 
to the condition (11'), From this standpoint it is easily seen to 
be invariant. In fact, changing the variables, and denoting for 
shortness I *'(«')) 

by F(x}x'), 

equation (11') changes to the equality 

F(x' I .f) ~ F(x j x') 


which, as we have just seen, is equivalent to (13). 
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We could show in the same way that if a oon^ramnan^ double 
system is symmetrical with respect to one system of co-ordinates, 
it is still s 3 rmmetrioal after any linear change of variables; a 
mixed system a*, however, has not this property. For an afUi- 
symmetricdl double system, either covariant or contravaiiant, 
we could also show similarly that antisymmetry is an invariant 
property. 

We can now use the property just illustrated to establish 
the covariance of the coefficients of an invariant quadratic form. 
Let the quadratic form be 

n 

^{x) = 5,4 a;,®*. (14) 

1 

Changing the variables, <f>{x) evidently becomes a quadratic 
form in the x’s, which we shall write 

_ u 

^(®) 2 . 40 , 4 .®.®*. (14') 

1 

We shall show that the coefficients are the transforms by 
covariance of the coefficients or in other words are the same 
as would be obtained by changing the variables in F{x | «'). 
In fact, we get (j>{x) from F{x \ x') by first putting x' equal to x, 
or 

<^(x) F{x I ®), 

and from this, with the usual change of variables, we then get 
^(x), which is thus derived from F {x \x') by applying successively 
the two operations 

x' Xi, (a) 

== »;(*) (b) 

But the same result will obvioiisly be obtained if these two 
operations are applied in inverse order, i.e. if we pass first from 
F(x I x') to F(x I x') (the coefficients of which are by definition 
the transforms by covariance of the coefficients and then, 
by the operation (a), which implies x' — x and on account of 
symmetry does not change the coefficients, to the co- 

efficients of this last expression are therefore the transforms by 
covariance of the coefficients 
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6. Sets ot n covariant and contravariant simple systems. 
Theorem on reciprocal sets. 

We now propose to prove a lemma in which we shall have to 
consider, not a single simple system, but a set of n covariant 
simple systems. We must therefore distinguish the elements in 
question by two indices, one showing the ordinal number of the 
system from which an element is taken, the other (which will 
be an index of covariance or of contravariance) showing the 
ordinal number of the clement in that system. Consider, there- 
fore, the set of n covariant simple systems 

K\i (a. i = 1, 2, . . . n), . . . (15) 

where a represents the ordinal number of the system and is 
therefore not an index of cither covariance ur contravariance; 
and suppose further that the determinant of the A’s does not 
vanish, or in other words that the n systems are independent. 
With this hypothesis, to every clement A„|£ will correspond a 
reciprocal element (its algebraic complement or minor divided 
by the value of the determinant), which we shall denote by 

a;; (a, i - 1, 2, ... n). . . . (15') 

In a linear change of variables the terms A„|/ will be transformed 
by the law of covariance, and the transforms will be denoted by 
we shall take the reciprocal elements A^^ of the terms A„],, 
as representing the transforms of the reciprocal elements A^. 

We shall now show that this law is identical with contra- 
variance, i.e. that giving a the values 1, 2, ... w, the terms A^ 
constitute n contravariant simple systems; or shortly, that the 
reciprocal s^^t of n co variant systems is a set of n contravariant 
systems. This is the reason for placing the index i above. 

The hypothesis of covariance of the set of n systems (15) 
means that the n linear forms 

T„ = S, A,,|, Xi (a = 1, 2, . . . n) 

1 

are invariant. What we have to prove is that the n linear forms 

= SiA‘M, 


(a = 1, 2, . . . n). 
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are also invariant, i.e. that 

0.-^,, = S,Aii7,-S.A>.= O (a=l, 2, (16) 

1 1 

whatever the u's may be. Now these last expressions are linear 
in the u'q (since the u^s are merely linear combinations of the 
w’s), so that each of them is of the type 

i 

To show that this vanishes identically (i.e. that all the f’s are 
zero) we need only show that it vanishes when we give the u's 
n distinct sets of numerical values, as we shall then have n homo- 
geneous linear equations in the ^’s, whose determinant does not 
vanish (this condition being implied by the use just now of the 
adjective distinct). We shall give the w’s the values 

iP* i = 1, 2, . . . n), 

and hence, from the covariance of these quantities, we shall have 
to give the u'b the values A^j^, Using a property of determinants 
(given as formula (^) in the footnote on p. 65), and substituting 
in equation (IG), we get 

0. -4'a=^ 0 {a,p= 1,2. ... n), 

which proves the result required, 

7. Addition of tensors. 

Take two tensors (in general mixed) of the same kind, i.e. 
having the same number of indices of covariance and tlie same 
number of indices of contravariance (in particular, two covariant, 
or two contravariant, systems of the same order); 

■ * --V . 

. . . tm 

Summing corresponding elements (those with the same indices) 
we get a new system whose general term is 

depending on the same number of indices. We shall show that 
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this new system is also a tensor, covariant and contiavariant 
respectively with respect to the indices of covariance and contra- 
variance of the given systems, so that with the notation previoTisly 
adopted the general term can be written 

C^y -jfi. ^ 

To simplify the formulas we shall prove the result for the 
case of a single index each of covariance and of contravariance; 
the reasoning is identical in the general case. Our hypothesis 
then is that the forms 

F = Xi uj, 

1 

O =- Sy B>. X, Uj, 
are invariants. The sum 

2? + o = Sy {A\ -f Xi Uj = Cl Xi Uj, 

will therefore also be an invariant, which is as much as to say 
that the system 

^ + B’ 

is covariant with respect to the lower index, and contravariant 
with respect to the upper. 

The tensor C is called the sum of the two tensors A and B. 

8. Multiplication of tensors. 

We shall now define the product of two tensors. These may be 
of any kind, in general mixed; wc shall suppose that one has m 
indices of covariance and p of contravariance, and the other m' 
and ft' respectively, so that they are represented by 

h ‘ • *m h . . . iffi' 

Construct the system whose general term is the product of 
any element A by any element B; the element so formed will 
contain m -f- p. -)- m' + /*' indices, so that the rank of the 
product system will be the sum of the ranks of the given systems. 
We shall show that it is a tensor which has the m -f- m' indices of 
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covariance of the given system as indices of covariance, and the 
fi' indices of contravariance as indices of contravarianoe. 
To simplify the formulee we shall as before consider the case of 
only two indices. 

Let the two forms which by hypothesis are invariant be 

F = Ea,A^x,u,„ 

1 

<D = 

1 

Their product will also be an invariant, and is 
or, putting 

A^B^ = Cf, 

F<t> ^ XiXjUhui 

1 

The invariance of this form means that the indices i and j 
attached to the letter C are indices of covariance, and h and k 
are indices of contravariance, which proves the statement just 
made. The argument is the same in the general case. 

9. Contraction of tensors. 

We shall now define the operation of contraction, by which 
we pass from any mixed system to another system having one 
index of covariance and one of contravariance less than the 
first. 

For convenience of printing, we shall give explicitly only 
one of the indices of covariance and one of contravariance, 
replacing the others by points, so that we shall put 

a:" 

to represent the general term. 

Now construct the system 

’* . r 

B [ == 

1 

which will contain all the indices, except the two shown on the 
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right; we say then that the tensor has been contracted with 
respect to these two indices. We shall show that the system 
so obtained is also a tensor, having the same indices of covariance 
and of contravariance — except of course the pair used in con- 
tracting— as the given tensor. To simplify the formula), we shall 
as usual consider a particular case, but one not differing essentially 
from the general case. Suppose, therefore, that the form 

71 

F A’l* X, xl u,, u' 


is invariant whatever may be the variables x, x\ w, u\ the only 
restriction being that x, x* are point variables and w, u* dual 
variables. Their values being arbitrary, we may replace the 
variables u[ by n distinct systems of covariant quantities, which 
we shall denote by using the notation (15) of § 6; we can 
then replace the variables x\ by the quantities which are 
the reciprocal elements of the former group, and therefore con- 
travariant (§ 6). We shall thus have the n linear forms 

r. (a = 1, 2, . . . «) 


all invariant. Their sum G will therefore also be invariant. 
Writing out this sum, and making some slight transformations, 
we get (remembering the fundamental property of reciprocal 
elements) 


11 1 1 


Now we know that = 0 if r =)= « and — 1 if r -- s, hence 
all the terms in the sum for which r 4= will disappear, and there 
remains 

X 111 


The invariance of this form shows, as was required, that the 
system „ 


is a tensor covariant with respect to the t’s and contravariant 
with respect to the A’s. 
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The operation of contraction can evidently be repeated several 
times, contracting successively with respect to various pairs of 
indices, so that, for example, from the system 

jhks 

we can pass, by using two pairs of indices, to the tensor 

J>h V 

If the process is applied to the only pair of indices of a mixed 
double system, the result is an invariant: 

A ^ 

1 

10. Composition of tensors. 

If we combine the operation of multiplication of two tensors 
with that of contraction, we get the operation called co^nposition 
(or inner multiple cation) of two tensors. We shall write the two 
tensors in the abridged form 

A\ \r^ B/ 

where we show only a single index of covariance for one and of 
contravariance for the other. 

Tlic tensor 

C" -=kA ■, B 

I 

is said to be compound, ed of the first two or is called their inner 
'product', its indices of covariance are those of A, except r, and 
all those of B, and its indices of contravariance are all those of 
A, and those of B, except 

It can at/ once be seen that the system C is a tensor, observing 
that it is obtained by contraction with respect to the indices 
r and s from the system 

r;;;' = a\ b \ 

which is itself obtained by taking the product of the given 
systems. Thus, for instance, compounding the systems 
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with respect to the indices r and s, we get 


a hk V J* "Rkr 

ij — • 


11. Change of variables in general. m-Iold systems whose 
elements are functions of position. First general definition of a 
tensor. Typical tensors of rank 1. 

Up to this point we have considered only linear changes of 
variables, and we have defined, with reference to them, covari- 
ance, contravariance, and the fundamental operations on systems. 
We shall now extend these definitions to any change whatever 
of the variables. 

Suppose, therefore, that the formulae of transformation, instead 
of equations (5), are 

= / (* 1 . « 2 . • • • ^») (» = 1, 2, . . . n), (17) 

where the //a denote arbitrary functions, except for the quali- 
tative restrictions as to differentiability, &c., which will be 
tacitly imposed whenever necessary, and the condition that the 
transformation is reversible, i.e. Miat the equations (17) are 
soluble for the »’s and can therefore also be given in the equivalent 
form 

. a-,.)- • . • (17') 


The general transformation (17) involves a linear transfor- 
mation of the differentials. In fact, putting 


dx,_ 

dx^ 





(18) 


we get, differentiating (17) and (17'), 


dXi — 


** ^ or . 

dx, 

1 dx^ 

\ OXj^ 1 


^uc dXh', . . (19) 

1 

(i= 1. 2,. . .n). (19') 


The second of these groups of formulae mi»t be identical with 
the group which would result from solving the first; the quantities 
c*"* must therefore be the reciprocal elements of the quantities 
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which justifies the choice of these symbols to represent the 
deiivatives.^ 

From the analogy of formulae (19), (19') to (5), (6'), we can 
at once extend the earlier arguments to m-fold systems whose 
elements are any functions of position (i.e. of the independent 
variables scg, . . . x„). We shall say that an m-fold system 
whose elements are fimctions of position constitutes a tensor, 
CO variant, contra variant, or mixed, with respect to a generic 
transformation (17), when it is a tensor of the specified kind 
(at every point of the field considered) with respect to the linear 
traiislormation (19), (19') between the differentials of the old and 
the new variables. 

In consequence the differentials of the independent variables 
provide us with the typical contravariant simple system. AVe 
shall next consider what is the typical co variant simple 
system. 

In § 3 we introduced the dual variables w,, which were formally 
defined as the coefficients of a linear form in the variables x. 
These latter are now to be replaced by their differentials dx, so 
that we start from a generic Pfaffian 

n 

4/ = 'LiUidXi, 

1 

and consider it as invariant for any change whatever of the vari- 
ables X. The coefficients u are considered as functions of position, 
and hence initially of the x’s. When the transformation (17) 
is made, the dependence on the point co-ordinates is expressed 
instead in terms of the new variables x. Substituting in ifs for 
dxi from (19), we see in the first place that we still have a Pfaffian 

^ This can also be shown directly, by proving that the terms and cjd have 
the fundamental property of reciprocal elements. Jn fact, if in equations (17) 
we replace the x’s by the expressions given by equations (17'), they reduce to 
identities. Ditferentiate one of these with respect to a?*, using the rule for a 
compound function. We shall have 

BXh ddl'k 

Now the left-hand side is 0 or 1 according as t =}= A; or t = k; on the right-hand 
side we can introduce the notation (18), so getting 

= s* «*»«**. 

1 

which proves the required result. 

( D eKift ) * 
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in the new variables x; this is obvious, since the original expression 
is linear in dx. Writing out the result, we get 




H n ^ 

dx, 

1 1 


Jdx, 

1 dXfg 


n n 

1 1 dXf^ 


The coefficients of the new differentials dxf^, i.e. the elements 
Uic of the system which is the transform of the coefficients 
are therefore 

(A == J, 2, . . . n). 

1 dXjf. 


Interchanging i and k and adopting the notation (18), we get 
the law of transformation for the coefficients of a Pfaffian expressed 

by the formula) ,, 

~ 

1 

which are identical with the formulye (6'). Adding the inverse 
formula) and re[)lacing the coefficients by their values as 

given by (18), we get the trarisformation fomiulw for the coefficients 
of a Pfaffian {an invariant) which constitute the typical simph 
covariant system^ in the explicit form 

(20) 

1 dx, 

{i=l,2 n). . (20') 

1 O Xf 


Suppose in particular tliat the (invariant) Pfaffian is the exact 
differential of a function u of position; being invariant, u is such 
that its expre^ssion in terms of the J-’s is obtained from its expres- 
sion in terms of tlie x'a by substituting /• (x) for and vice 
versa, so that the formula 

u{x) — u(x) 


is an identity when we substitute in it the expressions given by 
(17) (or (17') ) for the x’s (or the x’s). 


du 


^ u 

The coefficients Uj of the Pfaffian are respectively - or 

ox,- 


according as du is considered as expressed in terms of the x’s 

dX; 


or of the x’s. 
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It follows that the derivatives of an invariant are transformed 
hy covariance, the law being given by formula (20), (20'). 

Vice versa, to obtain the formul® of covariance (20) or (20') 
relative to a simple system, without having to go through all 
the steps from the beginning or to remember them by heart, 
the easiest memoria teehnica is to consider the elements of the 
generic system in question as being for the moment the deri- 
vatives of a single function, and to apply the rule for differentiating 
a function of one or more functions. We then automatically get 
formulae (20) or (20') according as we start from an original or 
a transformed element. 

The direct transformation of the differentials further, as we 
have seen, gives formulae (19) and (19'), which we can use as 
the transformation formulae for a generic contravariant simple 
system, by substituting the original elements f ' for the differentials 
dXf and the transformed elements for the differentials dxf^. 

To sum up, the differentials of the independent variables and 
the derivatives of a single function give what we may call the 
pattern of the transformation formula3 for simple contravariant 
and oovariant systems respectively. 


tlie same law of transformation as the u, 


Let the co- 


12. Second general definition of tensors whose elements are 
functions of position. Examples. 

Take a multilinear form in any number of sets of contravari- 
ant variables (i.e. having the same law of transformation as the 
dx^) and in any number of sets of covariant variables (i.e. having 

dxj' 

efficients be considered as functions of position, and the given 
form as invariant at each separate point. From the definition 
given in the preceding section it is clear that the coefficients 
form a mixed tensor, whose indices of covariance are those relative 
to the contravariant variables, and vice versa. Reciprocally, 
every tensor, in the sense of the first definition, can be identified 
with the coefficients of a multilinear form of the kind just 
described. The two definitions are therefore completely equi- 
valent. 

From this point everything is analogous to what was said in 
§ 4, and we may therefore dispense with further details, except 
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to repeat once more explicitly the remark made at the end of 
§ 4 as to the vanishing of a tensor (i.e, of all its elements) being 
an invariant property. The property holds in general for any 
change of variables of any kind. In other words, if all the elements 
of a generic tensor 

7*2 .. . 

^ti . . . iVn’ 


referred to a particular system of variables, are zero, we may 
be sure that the equations 


-v ^ 0 

*1 . Hh 


(^1> ^2’ • • • ^1) ^2> • • * — 1, 2, • ■ • M-) 


continue to hold however the variables may be changed. 

We shall close this section with two examples of tensors 
which occur fairly often. 

Consider first a linear operator where 






.3/ 

dx,: 


whose coefficients are specified functions of position. Let us 
treat the operator as an invariant. Then since the terms 

dx, 

are covariant, it follows that the A^'s are by definition contra- 
variant, and must therefore have their law of transformation 
given by the equations (19'), so that we get for the transformed 
coefficients the expressions 

1 dx„ 


as could easily be verified directly. 

Consider next a differential quadratic form 


<f> = 


n 

1 


which is to be invariant; the coefficients a^, (in general to be con- 
sidered as functions of position) will then be covariant, and hence 
their transformation formulee will be 




n 


a. 


dXr dXg 
dXi 035 *.’ 


( 21 ) 
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or (solving for the transformed elements) 



dx^ dXg 


(21') 


13. More complex laws of transformation. Scope of the 
Absolute Differential Calculus. 

In a generic change of variables a system, as we have said, 
is transformed in a way which depends on its definition. The 
cases so far examined have been the simplest, but others of 
considerably greater complexity may also occur; we shall now 
give an example of these. 

We have seen that the simple system composed of the first 
derivatives of an invariant function u is covariant; we now 
proceed to examine the double system of the first derivatives 

0 W • 

* of a covariant simple system u,. As a particular case, if the 
u/s are the derivatives - — of a single function u, we cover the 

OXi 

case of the transformation of the second derivatives of an in- 
variant function. 

To find the transformation formulae for this system, i.e. the 
relation between the terms ^ and the terms , we start from 

dXj dXj 

the transformation formula for the w/s: 


Ui 


) OXi 


Differentiating it with respect to Xj, and considering the %’s 
on the right as functions of the x^s and therefore of the x's, we 
get 


" dx^ dx^ dut ” a®®* 

dxj ^x^ dXj dx,i i*" ^x^^Xj *' 


( 22 ) 


If the last sum were absent, the law of transformation would 
be that of covariance. But in fact the presence of the second 
derivatives of the x’s with respect to the x’s shows that the 
system we are examining is neither invariant, nor covariant, nor 
contravariant, nor mixed, and therefore is not a tensor; its law 
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of transformation is more complicated than any we have yet 
examined. A similar result is true more generally for the system 
composed of the derivatives of any tensor. 

It is often necessary to consider the derivatives with respect 
to the independent variables of the elements of a tensor, covariant, 
contra variant, or mixed. In order to avoid the complication 
just observed, it is therefore convenient to replace these deriva- 
tives by linear combinations of them with the elements of the 
tensor, so chosen that those terms which lead to the aforesaid 
complication disappear in the transformation formula). This is 
the problem which the Absolute Differential Calculus proposes 
to solve; it does so, as we shall see farther on, by introducing 
an auxiliary element, namely, an invariant differential quadratic 
form. We shall therefore devote the next chapter to the study 
of this important element. 


CHAPTER V 

Geometrical Introduction to the Theory of 
Differential Quadratic Forms 

(a) The Line Element on a Surface 

1. Parametric equations of a surface. 

The meaning of the term “ parametric equations of a sur- 
face” is known from analytical geometry. We propose, however, 
here to examine the idea from the beginning, in order to find the 
formulse in the shape which is best suited to our purpose. 

We shall use the letters ^j, Vz throughout this chapter 
to represent the Cartesian co-ordinates of the points of space 
referred to three orthogonal axes. Now consider a surface, or 
more generally a piece of a surface a, to which alone the following 
remarks are understood to apply, and suppose that there has 
been established, in any way whatever, a one-to-one correspon- 
dence between the points of a and the pairs of values which can 
be assigned to two parameters Xj, Xg within a certain field C 
of a plane representative of the arguments Xg (cf. the general 
remarks in Chapter I, § 1). 
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Tliis implies that the points of cr and with them their Cartesian 
co-ordinates are definite (and finite) functions of X2 in the 
field C. We shall accordingly write 

Vv = ^ 2 ) 2 , 3 ), ... ( 1 ) 

where for subsequent purposes the three functions must have 
derivatives, to any order which we may have occasion to consider, 
which are continuous in the fiehJ C. 

But this behaviour of the functions is not in itself sufficient 
to ensure that the equations (1) do effectively define a surface, 
i.e. that the supposed one-to-one correspondence does in fact 
exist between C and the points of a portion of a two-dimensional 
manifold. 

It might for instance happen that only the sum x^ + ^2 
appeared in the equations (1), in which case the dependence on 
two parameters would be only apparent, only one of them being 
essential. In this case the cquatioiLs (1) would define a piece 
of a curve. To exclude the ])ossibility of anything of this kmd 
we shall suppose that two of the equations (1) are soluble (witliin 
G) for Xj^, Xg, so that by solving them, and substituting the 
values so found in the remaining equation, we can get one (and 
only one) relation between y^, y^y ^3, i.e. the equation of a surface. 

This is equivalent to imposing the condition that the char- 
acteristic of the functional matrix^ of the equations (1) is 2. 
Then the equations (1) will actually represent the parametric 
equations of a piece of a surface ct; and it could be shown that 
— with the restriction, if necessary, of the field C to a convenient 
portion P of itself (around an arbitrarily chosen point)— the 
portion of surface so <lefined is such that to any point on it there 
corresponds one and only one set of values of the parameters 
in the field P. Accordingly, with this qualitative restriction as 
to the field — which wo shall always consider as being of the 
type P — in which the parameters a^j, are made to vary, we 
are quite justified in calling x^, X2 curvilinear co-ordinates on 
the surface cr defined by equations (1). 

Giving a constant value, and making x^ vary, we get all 
the points of a line, which we shall call the line x^ = constanty 
or the line x^y or more shortly, the line 2 (since only x^ varies along 


g§ 6, 7, pp. 8 12. 
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it); in the eame way we can define the lines = constant, or 
the lines Xj^, or merely the lines 1, as those along which only asj 
varies. We can thus think of our surface (or portion of surface) 
<T as covered by a double network of lines {co-ordinate lines) 
such that two and only two — one line and one line x ^ — pass 
through every point of it. 


2. Expression for ds^. 

We shall now fix two infinitely near points, P, P', on a; 
let their curvilinear co-ordinates be 

x„ x,-j-dx, {i = 1, 2), 

and, subject to the equations (1), let 

S/-. y. + {v = 1, 2, 3) 

be their Cartesian co-ordinates. 

Note that in order to specify a point P on tr, we may take 
arbitrarily (within F) the two co-ordinates x^, x^, and so also, 
in order to reach P', the two increments dx^, dx^. 

The t/’s are defined by the equations (1), so that their differ- 
entials are connected with the dx's by the equations 

dy,--^ i,^^'^''dx, (v=l, 2 , 3 ), . . ( 2 ) 

1 ox^ 


which are obtained by differentiating the equations (1). 

We shall calculate the distance PP' = ds, or rather, as 
being more direct, its square 

8 


ds^ = 

i 


Substituting the expressions (2) for the dy's, we shall have 

ds^ = - i,,dx,dxA^y^- 

1 1 1 1 OXi OXff 


„ V ^y. 

duc — 




|rom which, putting 


( 3 ) 
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(by which we define a very important symmetrical double system 
of regular functions of the z’s), we get 

•z 

ds^ = ^Uc • « • « 

• '1 

This quadratic form, which, as we shall see, is fundamental 
for the study of tlie metrical properties of our surface, is an 
obvious generalization of the expression 

ds^ = dx^ + dy^ 

which in Cartesian co-ordinates gives the distance between two 
infinitely near points of a plane. 

We shall now show that the form (4) is definite and positive 
i.e. that it never becomes either zero or negative, whatever 
values (real and not zero) are assigned to the dx^s. That it 
cannot be negative is at once seen from the fact that it is 
the sum of the squares of the dy's, which are always real if 
the d3''» are real. It could therefore vanish only if all the 
vanished, and we shall show that this is impossible for any 
actual displacement (one in which dxi and dx^ are not both 
zero). 

In fact, let us try to suppose that we can have 

dyj di/2 0 . 

Using equations (2), these become three linear homogeneous 
equations in dx^, dx.^. In order that any two of these may be 
satisfied by non-zero values of these variables, the corresponding 
determinant must vanish; since we may choose arbitrarily the 
pair of equations to be satisfied, we conclude that all three 
of the functional determinants (of tlie second order) of the 
y's with respect to the ir’s must vanisli, which contradicts 
the hypothesis that the characteristic of the functional matrix 
is 2. 

The general theorem relating to simultaneous linear homo- 
geneous equations could also be applied directly; namely, that 
the number of independent solutions is the difference between 
the number of the unknowns and the characteristic of the matrix 
of the coefficients — in our case 2 — 2, or 0. 

From the proof that the quadratic form under discussion is 

V D 0.^5 ) 4* 
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definite, it follows, by a known theorem ^ on quadratic forms, 
that the determinant 


a = 



composed of the coefficients of ds^ (called the discrimivmmi of 
the form) is not zero; in particular, when, as in the present case, 
the form is positive as well as definite, we have specifically a > 0. 

The fundamental form (4) calls for one last remark, almost 
obvious but important. This is that the system of the coefficients 
is covanant with respect to any transformations whatever of the 
variables Xy, (which justifies our having placed the indices 
i, k below). This covariance follows directly (ap}>lying a remark 
made at the foot of p. 84) from the invariance of the quadratic 
form 


3. Determination of the directions drawn from a generic 
point. 

In the space y^^ ^ 3 , a direction drawn from a generic point 
P may be considered as determined by an infinitesimal segment 

^ Tlie theorem referred to is as fcjllow s. Let 

n 

(f> = Cf-tfc Xi Xfs 

1 

be a definite quadratic form in n varialdes; uc* nhall show that its discriminant a 
cannot he zero. 

In fact, putting n 

yi = {i = 1, 2, . . . n), 

1 

n 

we get 0 = 

1 

N<)\v if a = 0, wo eouhi make 0 — 0 without all the ar’a being zero (contrary 
to the liypotheaia that the form is definite) ; we should only have to make all the 
y’s zero, by solving the n linear hoinogene<ms equations 
n 

=0 (7 == ]. 2, . . . tt), 

1 

which Avoiild lie soluble, giving values for the ar/s which are not all zero, provided 
a = 0. 

For definite iKisitive ftirms a is therefore also necessarily positive. One way of 
seeing this is to apjtly one of the infinite number of (real) linear substitutions 
which reduce 0 to the canonical form (see e.g. Bianchj : Lezioni di geoinetria 
anaHHea, Appendix, pp. 571-592 ; Pisa, 8p<»erri, 1920). It is obvious that a 
jKisitive form which contains only squares of the variables has its discriminant 
a > 0. But the original a and a are conncHited by the relation a = where 
A denotes the dt^terminant of the linear substitution. (See p. 157.) We 
therefore necessarily also have a > 0. Q.F.D. 
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having one end at P, or if preferred by another point P' infinitely 
near P, or, which comes to the same thing, by an infinitesimal 
displacement of P. 

Now suppose that P belongs to a, and consider the directions 
drawn from P which are langeM to the surface. To determine 
them we have to take points P', infinitely near P and belonging 
to a. If therefore we call the surface co-ordinates of P and 
X 2 , we can determine P' by the surface co-ordinates -f- dx^, 
+ dx^. 

Thus to each pair of irtfinitesimals dxj, dxg, there corresponds 
one and only one tangent ial direction drawn from P. To one direc- 
tion, on the other hand, there correspond an infinite number 
of pairs of differentials which differ from each other by a (positive) 
factor, since the length ds of the segment PP' chosen to deter- 
mine the direction is a priori arbitrary, the only condition being 
that it is infinitesimal. 

In order to make the correspondence one-to-one, we shall, 
in order to determine a direction, replace the differentials by the 
proportional quantities 

\\ _ >2 d^'2, 

ds ' ds ' 

those are unchanged if we multiply dx^ and dx^ by a positive 
factor h, since it follows from equation (4) that then d.s is also 
multiplied by k. 

These quantities are called parameters of the direction and 
obviously reduce to direction cosines when the surface cr is a 
plane and X 2 represent orthogonal Cartesian co-ordinates. 
The parameters are not independent but are connected by the 
relation 

- 1 , .... (5) 

1 

which is obtained by dividing equation (4) by ds^ and which 
corresponds to the well-known identity for the Euclidean plane 
that the sum of the squares of the cosines 1. Since ds is an 
invariant and the dx"^ are contravariants, the parameters are 
also contravariants, which justifies our having placed the indices 
above. 
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Instead of the parameters two linear combinations of them 
are sometimes used; these are 

Ai - (i = 1,2), ... (6) 

1 

which are called moments. Since the coefficients form a co- 
variant double system (cf. § 2), and the parameters, as we have 
just shown, form a contra variant simple system, it follows that 
the moments are covariants.^ 

We showed in § 2 that the determinant a is not zero; the equa- 
tions (6) can tlierefore be solved, giving the formulae 
'* 2 

A* - (6') 

1 

which give the parameters in terms of the moments. The para- 
meters and moments are connected by a particularly simple and 
remarkable bilinear relation, which follows immediately froni 
(5) and (6). In fact, multiplying the equation (0) for the generic 
index i by A', and summing for i - - 1 and i -- 2, we get from (5) 

S A, A‘ - 1 (r/) 

1 

It follows directly that the moments also are connected by a 
quadratic relation. We need only substitute in (5') for A' the ex- 
pression given by formula (6'), which gives at once 

i,a'*A,A. - 1 (5") 

1 

4. Angle between two directions. Contravariance of the 
coefficients 

Consider two directions on a surface drawn from a single 
point P. We shall denote them by X and (Jt, when* these two 
symbols mean more precisely the two unit vectors which deter- 
mine the given directions. The parameters and the moments 
of X will be denoted by A^ A,, and those of pt by fjL,, respectively. 
We propose to find the angle & between the two directions as a 
function of the parameters or of the moments. 

Denoting the increments of the co-ordinates by dy^,, 


1 Cf. § 10, p. 70. 
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dxi respectively, the direction cosines of X, for a displacement 
ds along X, will be 


ds I'dx^ d$ 


(v = 1, 2, 3), . (7) 


Similarly, denoting by the symbol S the increments of the co- 
ordinates for a displacement Ss along (x, we have for the direction 
cosines of this direction 


--- i.^y-' =r s 

8s 1 '' dxjj. 8s dxk ' 


. (7') 


Hence, from the usual formulae of analytical geometry, we 
get 

cos® = ^.V' = \y- XV =. ly-. 

1 (is bfi 1 1 dxi oXfc 1 1 dx,f 

and therefore finally 

cosa> — 2^4. a, 7. A' /x'' (8) 

1 


Substituting for fjJ\ or A‘, or both, their expressions in terms 
of the moments, we get for cosh the following equivalent expres- 
sions: 

•> 

COSf^ t,X'fJL„ (S') 

I 

<•> 

cosll -= S„ A, ya', (8") 

1 

0 

cos» a''' A, /X,. (8"') 

1 


The last of these formulae enables us to see that the notation 
tt** is in agreement not only with the convention adopted for 
reciprocal elements, but also with that of Avriting the indices of 
contravariance above. For putting 

u, ^ \ds, Vi, •= /X485 (i, A: = 1, 2), 

where we note that the w’s and the are independevU variables 
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(not connected by any relation as are the A’s and the fi’s), we can 
write equation (8'") in the form 

2 

ds8s cofA& = UiVfci 

1 


since the left-hand side is invariant, and the right-hand side con- 
sists of a bilinear form in two sets of arbitrary covariant variables, 
it follows that the coefficients are contravariant. 

To find sinS*, we can form the product by rows of the two 
determinants 


/X- 


X 


Ai Ag 

Ml M2 


Applying formulae (5'), (8'), and (8"), this becomes 
] cosfi 


cos{> 


1 


= 1 — cos“^ — sin^fi'. 


We therefore have 


sin9- = , / 

AJ 

A^l 

X 

jA. 

^2 

V 


I*® 1 

! Ml 



(9) 


where the radical must have the sign +? wince by definition the 
angle between two directions always <* tt, and therefore sint) 
> 0. 

The expression (9) can be put in another form. It is easy to 
verify that 


Ai 

A 2 

^12 


A2 

= a 

Ai 

A2 


M2 

®21 ^'22 

’ \h^ 






and therefore 


or also 


sin^ — Va . 

i 

Ai 

A2 



1 

Ai 

A 2 I 

sinS- = —7=^ 

w a 

H-i 

! 

/*2 ' 


(9') 


(9") 


where in each case it is understood that the radical v^a is to have 
its arithmetical value. 
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5. Associated, and in particular reciprocal, tensors. The 
tsrpical example of the parameters and moments of a single direc- 
tion. 

Take a generic tensor (with reference to the variables Xy, 

jh, A, . . . ly 
ly . im 


of rank m + if we compound it with the coefficients 
of our expression ds^, we can transfer any one of the indices h, 
say from above to below, so getting 


It i-2 . . 


2 

1 


A 


kh.^ . 

ii . . 


V 


which is a tensor of the same rank, but with an index of covari- 
ance more and an index of contravariance less, namely Simi- 
larly, compounding with the contra variant double system com- 
posed of the reciprocal elements a'^ (cf. § 4), we can transfer 
any one of the indices of covariance, say from below to above. 
We need only put 





it, a'‘* A 


1 


Jiy Aa . 
k t-j , 


in which the system C is also a tensor of the same rank. 

These operations can obviously be repeated, so as to transfer 
not one but several or even all of the indices of the given tensor. 
All the tensors so obtained are called associated tensors of the 
tensor that association is a relation which is 

dependent on a given In particular the tensor 


’2 • • 
h-\ ht ... 

*1 

- . . Jm ^'i *'2 • • - 'V ^ ^ 


. . . a^, 4- af , ... a,, j, 


in which the indices of covariance are the same as the indices of 
contravariance in A, and vice versa, is said to be reciprocal to 
A, the use of the term being justified by the consideration that 
the relation is reversible, A being the reciprocal of Z in the same 
sense as Z is of A. This can be shown explicitly if we suppose 
the above formulae defining the system Z solved for the A^s. 

Equations (6) and (6') show that the parameters and moments 
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of a single direction form a particularly simple and striking 
example of a pair of reciprocal systems. 

Remark /.—The definition of the tensors associated with a 
generic tensor A involves essentially a specific ds^, whose co- 
efficients and their reciprocals a''*' form i>art of the definition. 
When it is necessary to emphasize this fact, we can do so by 
speaking of the tensor or tensors as associated with respect to the 
ds^ in question. 

Remark //.—For the symmetrical covariant double system 
aijc and the contravariant system composed of the reciprocal 
elements a'^‘, from which the associated tensors are constructed 
by composition, we coxdd plainly take the coefficients of any 
other invariant quadratic <f> instead of those of ds^ (provided 
only that is irreducible, so that the reciprocal does in fact 
exist). We should then have associated systems with respect 
to the quadric <j>. 

Remark III. — We may point out at this stage that the 
idea of associated systems holds good as it stands for any 
number of variables Xg, . . . x,^. We need only suppose that 
the indices take the values 1, 2, ... n, and that the auxiliary 
element is represented by an irreducible differential cjuadratic 

a 

form <f> — 2^4- a, 4. dx^ dXf. in n instead of in two variables. 

1 

6. Surface vectors. 

Let R be a non-zero vector drawn from a point P of the surface 
a, tangentially to the surface; we shall call it a surface or tan- 
gential vector, and we can determine it by its Cartesian components 

(j^ = 1, 2, 3) or, in closer agreement with its intimate relation- 
ship to the surface, by its magnitude R and its direction, the 
latter being determined by its parameters A' or its moments 
A^. These three quantities are not independent, since the para- 
meters (or moments) are connected by the usual identity; the 
vector is therefore determined by two essential quantities. It 
will accordingly be convenient to represent it by the two inde- 
pendent quantities 

R = RX^ (i ^ 1, 2), . . . (10) 

or alternatively by the pair 

R. - R\ {i - 1, 2), . . . (10') 
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which are called respectively the contravariant and covariant 
components of the vector. 

These obviously form a pair of reciprocal systems, since by 
the preceding section the parameters and moments are reciprocal, 
and equations (10) and (10') show that R and R, differ from the 
parameters and moments only by a common factor R, 

R can be calculated from them by means of the identities 


y:,,a,,RR' ^ ... ( 11 ) 

. . . ( 11 ') 

1 

2 

2 , 72 , 7 ?' - 7?2 ( 11 ") 


(which are merely (5), (5'), and (6"), each multiplied by 72^), 
and then A' and A, follow from equations (10) and (10'); thus 
we see that the vector is comjdetely determined by its contra- 
variant (or by its covariant) components. 

To find the relation between the contravariant components 
and the components Y,, with respect to Cartesian axes «/j, 
note that the direction cosines of the direction whose i)ara- 
meters are the A's are given by equation (7), and hence the 
components Y„ (which are equal to these cosines each multiplied 
by 72) are given by the equation 


Y 


p 



( 12 ) 


It is now obvious that the covariant components can ba 
obtained from the contravariant components, and vice versa, 
by means of formulic completely analogous to (0) and (6'), and 
obtained from these by multiplying them by R, 

If we have to deal with zero vectors, i.e. having their length 
R zero and their direction indeterminate, we find that in order 
to satisfy equations (10) and (U)') in this limiting case we have 
to take 72‘ — 0, 72, ~~ 0. With these values all the other 
equations ((11), (11')? &c.) are also satisfied, as can at once 
be seen from the fact that both sides of each equation vanish 
separately. 

By an analogous procedure we can find simple expressioni^ 
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for the scalar product R X V of two surface vectors B, V, re- 
membering that if S’ is the angle between the vectors we have 

R X V == fiFcosa* (13) 

In fact, considering first the general case of two vectors neither 
of which is zero and whose versors are X and (jl respectively, and 
multiplying equation (8') by 22F, we get 

= .... (14) 

i 

while the equations (8), (8"), and (8'") would give analogous 
formulae. 

The expression (14) for the scalar product also holds, like 
formula (13), when one or both vectors are zero, the scalar pro- 
duct (by definition) and the right-hand side being then zero in 
both formute. 


7. Parameters and moments of the co-ordinate lines. Element 
of area. 

We shall next obtain the direction parameters of a co-ordinate 
line, e.g. the line (i.e. Xo — constant), considered in the 
direction <jf x increasing. For an infinitesimal displacement in 
this direction, we have 

dx^ — 0 , dfi^ ■ ^ dxj^ + 2^12 ^^2 + ^22 ^^ 2 " ™ ^11 dx^^. 

Since ds is essentially positive, and dx^ is positive by hypo- 
thesis, we have, extracting the square root of the last of these 
formula3, 

ds \/ Oy^dx^^ 


where the radical is taken positively. It follows that 
dx^ 1 xo dxct 




ds 




A2 


ds 


-= 0 . 


(15) 


Similarly, the parameters of the line 2, in the direction of 
ij increasing, will be 


•' - 0 , 




(15') 
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Substituting these expressions in the equations (8) and (9'), 
we can get the angle Q between the two co-ordinate directions. 
The resulting formulae are 


cosQ ™ 

%2 

s / 0^22 

■ • (16) 

sinQ — 

s / a 

x/ ^22 

• • (16') 


Equation (16) shows that the necessary and sufficient condi- 
tion that the co-ordinates rcj, iCg be orthogonal is — 0. 

If we take an infinitesimal element of surface, obtained by 
drawing two infinitesimal segments ds, hs, from a point P along 
the co-ordinate lines, and comjdeting the parallelogram, the 
area of this element will be 

dxj dsihs sin£2 

s/ a 

s / <^22 

8. Fundamental observation (Gauss’s) on the intrinsic geo- ^ 
metry of a surface. 

We are now in a position to make an observation which will 
show fully the importance of the quadratic form (4) in the study 
of the surface. For this purpose we shall first make use of certain 
intuitive considerations in order to fix the idea of the intrinsic 
geometry of a surface. 

Let us give the concept of a surface a material form by think- 
ing of a flexible and inextensible sheet of matter on which figures 
can be drawn, and such that it can be deformed, bent, and folded 
in an infinite number of ways, but not torn or stretched. When 
a surface of this kind is deformed the figures drawn on it will 
take different spatial configurations, but some of their proijerties 
will be invariant. For instance, if two lines intersect, they retain 
this property however the sheet is deformed; the length of a 
segment of a line remains the same, and hence the distance 
between two points, measured along the surface (i.e. along the 
shortest line joining them which lies wholly on the surface), 
is unchanged; the angle between two lines which meet at a point 


s/adxydx^, (17) 


— a^i dx^ . \/ Uoa • 
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is unchanged; and so on. In short, all those properties which 
involve no element alien to the surface (or, as it is usually ex- 
pressed, which can be investigated without leaving the surface) 
are independent of the deformations of the surface, and con- 
stitute its intrinsic geometry. 

Even in elementary geometry we have examples of this 
kind. Plane geometry can be, and most of it is, constructed 
without using points outside the plane, and is therefore intrinsic 
as regards its plane; it still holds — at least for suitably restricted 
regions— if the plane ivS folded, or wrapped round a cone or a 
cylinder. 

Now consider the fact that the fundamental elements for the 
study of the metrical properties of a figure are: (a) the distance 
between two infinitely near points, and (&) the angle between 
two directions. In fact, the length of any line whateiver is found 
by integration from the length of its infinitesimal elements, the 
area of a figure can be calculated by breaking it up into elemen- 
tary parallelograms, and so on. Now the formula) (4) and (8) 
(or (8'), &c.) provide us with precisely these two fundamental 
elements for the study of the intrinsic geometry of a surface, 
whenever the coefficients of are known as functions of the 
ip’s; these coefficients therefore determine the metrical (intrinsic) 
properties of the surface, and are invariant for any deformation 
whatever of the surface which does not involve stretching. Hence 
the particular interest of all those theorems which can be expressed 
analytically in terms only of the surface co-ordinates x and the 
coefficients of the fundamental form; namely, the fact that 
they express properties belonging to the intrinsic geometry of 
the surface. The introduction into mathematics of this idea, and 
the fundamental observation relating to it, are duo to Karl 
Friedrich Gauss. 

9. Note on developable surfaces. 

A developable surface is one which is flexible and inextensible 
and can be made to coincide with a region of a plane, without 
tearing or overlapping. Exami)les are the cylinder and the cone, 
and any surface formed of several portions of a plane. The in- 
trinsic geometry of surfaces of this kind, as we have seen in the 
preceding section, is identical with that of the plane, and their 
line element can take the same forms as that of the plane; e.g. 



DIFFERENTIAL QUADRATIC FORMS loi 
we can choose a system of surface co-ordinates Xi, ccg, such that 

Consider a simple infinity of planes, which we may think of 
as represented by a linear equation in the Cartesian co-ordinates 
Vv ^2* whose coefficients arc continuous fiuictions of a parameter 
u. The envelope of this family of planes is a developable surface 
to which they are tangent planes. This proposition is rendered 
intuitive by the following argument based on infinitesimals. 

Let TCi, t 32 j ^3, ... be planes of the family corresponding to 
successive infinitesimal increments of the parameter u\ and let 
be the intersection of and 

the intersection of tu 2 and 
and so on. By definition, the 
geometrical locus of all these lines 
is the envelope surface. The lines 
</i, .<72, . . . are called its charac- 
teristics or generators\ each of the 
planes td contains two of them, 
forming an infinitesimal angle (cf. 
fig. 1 ), and the enveloj)e may be 
considered as made up of an in- 
finite number of these infinitesimal 
plaiie regions. Tt is thus clear 
that the envelope surface can be 
developed into a plane by successive rotations about the 
generators .% • • • 

We shall shortly liave occasion to consider the envelope of 
a particular family of j)laiies (depending on a single parameter), 
namely, the tangent planes to any surface whatever cr, at all 
points of a specified line T lying on the surface. The enveloi)e 
of these planes is a developable surface o-y., whicli is called the 
det 7 eloj?ahle circuymeribed to a along T\ since the tangent planes to 
O’ at points on T are also tangent planes to o-y, it follows that the 
circumscribed developable touches a along the line T. 

{h) Parallelism with respect to a Surface 

10. Geometrical definition. 

In Euclidean plane geometry, when two points P, Pj, are 
fixed, then to every direction drawn from P there corresponds 
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one and only one direction drawn from and parallel to the 
first. We now proj)ose to extend this idea from the intrinsic 
geometry of the plane to that of any surface a whatever. 

For this purpose consider a point P of cr, the corresponding 
tangent plane w, and a generic direction drawn from P tangenti- 
ally to <7 and therefore lying in w. We shall couvsider the direction 
as determined by the corresponding versor (unit vector) u, and 
shall accordingly refer merely to the direction u instead of the 
direction whose versor is u. Let P^ be any other point on cr, 
and TOi the tangent plane at P^. 

If the surface a is developable, we can obviously establish a 
correspondence, which we shall call parallelism, between the 
dhections drawn tangentially from P and those from P^. The 
direction which becomes parallel to u in the ordinary sense 
when a is developed upon a plane will be called parallel to u with 
respect to the surface. 

This criterion fails in the case of a non-developable surface 
a (even of the most elementary ty])e, such as a sphere), and it 
is natural to look for an adequate generalization of it. The most 
direct solution is obtained by adding to the elements of position 
already considered (which are sufficient without further definition 
for developable surfaces) a connecting law, a priori arbitrary, 
according to which Pj is to be considered as reached from P by 
moving along a specified curve T lying on a (the curve of dis- 
placement). 

We can now, with reference to this curve T, define parallel 
displacement from P to P^ as follows. Consider the developable 
circumscribed to <t along T; this surface, which we shall call 
Oy, is, as we know, tangential to a along the given curve, and in 
particular at P and P^. Hence the directions tangential to cr 
at these two points arc also tangential to We can now take 
for our definition of surface parallelism on cr along T the paral- 
lelism which we have associated with the developable a 7,, and 
we shall agree to say that the parallel at Pj along the line T 
to a generic direction (in the surface) u at P is the direction 
(in the surface) u^ which, on the developable cr^., is parallel to 
u in the sense just defined.^ 

^ A Hi 11 ) pie and ho to Apeak automatic way of construct! nsj parallel directions is 
to roll the surface tr alon^ a plane. Cf. Pkiiskjo : “ Reali/.zazione cinematica del 
parallelismo aupierficiale in Itend^ R. Acc. dei Lincuii, Vol. XXX (2nd 

half-year, 1921), pp. 127-128. 
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11. First consequences. Equipollence of vectors with respect 
to a surface. 

A necessary consequence of the foregoing definition is that 
—contrary to what happens for developables— the direction u^ 
which is parallel witli respect to the surface to the direction 
u at P, is not uniquely dcjtermined by P, u, P^, alone, but in 
general depends also on the curve of displacement.. From this 
point of view the geometrical concept of parallelism can be 
compared with the physical concept of work, which involves 
the integral of an expression of the form X^dxy + X^dx^ (where 
ajj, are co-ordinates, of any kind, of the points of o-). This 
integral in general depends on the line T of integration; only 
in the particular case when X^dx^ X2d^2 ^ perfect differ- 

ential is there no such dependence. 

Returning to parallelism along P, we must first point out that 
angles arc imcJianged by parallel displacement. That is to say, 
if a, b arc two generic directions (in the surface) at P, their parallels 
at Pji with res])ect to the surface, a^, bj, contain the same angle. 
Tliis IS obvious if we notice that we have parallelism in the 
ordinary sense in the plane upon which Oy, is developed, and that 
further the operation of development does not change angles. 

Up to this yx)int of the discussion we have referred solely to 
directions, with th(»ir corresjioiiding versors. It is clear that tlie 
sain(^ construction as that used to pass from u to u^ can be applied 
to a tangential vector R of any (non-unit) length R. If u is the 
corresponding versor, we have R -- Ru, from which we get a 
vector "Ri - /iu^, i.e. a v<ictor localized at having the same 
length as R and the same direction as the versor u^ which is 
j)arallel to u with respect to the surface. We shall naturally 
say that the vectors R and R,^ are equipollent with respect to 
the surface, with r< 4 erence to the path T. In substance this 
concept of equipollence with respect to a surface reduces at once 
to parallelism, two tangential vectors being equipollent when 
they are parallel and have the same length. 

The case where the curve of displacement P is a geodesic ^ 

^ I.e., with the usual defiuition, any line un <t such that at every point its 
osculating plane is perpend iriilar to the tiingent plane to a. '^I'he linos which give 
the shortest path lying on the surface l)etweeii two given points always have this 
property. Further, the reciprocal theorem is also true (under certain restrictions); 
hence to define geodesics we can use Sf>inetinies one and sornetiines the other 
criterion. We shall return to tlie question farther on (cf. p. 130), 
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on a calls for a special note in relation to parallelism. T is then 
also a geodesic on a.^. In order to see that this is so, note that 
a and a,^ have the same tangent planes at all points of T\ henoe, 
if the various osculating planes of T are normal to one of the 
surfaces, they are also normal to the other. When is developed 
on a plane, the geodesic T becomes a straight line (an immediate 
consequence of its characteristic property of giving the shortest 
path between any two points on it), and the directions u and 
which become parallel in the plane as a result of the development, 
will make equal angles with this line. Since development does 
not change angles, we deduce that parallel directions on a at 
points of a geodesic make equal angles with this geodesic,^ In 
particular, if u coincides with the direction of T at P, then 
Uj will coincide with the direction of P at Pi, or in other words, 
the directions of a geodesic at its various points are all parallel 
(along the geodesic itself); more shortly, the geodesics are auto- 
parallel curves. It follows from these arguments that auto- 
parallelism is a characteristic property of geodesics and can be 
used to define them,^ 

12. Infinitesimal displacement. Infinitesimal form of the law 
of parallelism. 

Suppose in particular that P^ is infinitely near to P, so that 
the path T is reduced to the elementary arc PP^, which is uniquely 
determined (except for infinitesimals of order higher than the 
first) by its extremities. For the development in this case we 
need only give the plane Wi an elementary rotation round the 
straight line r in which it intersects w. Incidentally we may 
note that the direction of this line is said to be conjugate to the 
direction PP^, at P or at (both points giving the same result, 
except for infinitesiinals). We shall denote by — co the infini- 
tesimal vector, parallel to r, which in magnitude, direction, and 

^ Taking this property as defining parallelism with respect to a surface we can 
deduce from it for tlie sphere an elegant geometric.o-kinematical construction from 
whi(‘h various other i>roperties follow easily. Cf> Cr. Coiwicliani : “Genesi 
cineipatica intrinseca del parallehsmiv di Ijevi-Civita ”, in Rend, delta R. Ace, dci 
Linreu Vol. XXXII (Ist half-year, 192‘J), pp. 72--7(>. 

“This statement will be i*ocognized as an obvious extension to surfaces of any 
kind whatever (;f the primary intuition of tlie nature of the straight line, expressed 
by Kuclid in tho words ei/jOeia ypafj.fA.'f) earip^ fjrtr e’f tcrov rots ^0’ dauTijv a-rj/iielois 
Keirat (a straight line is that which lies equally with res^iect to all its points). 
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sense repreaents the elementary rotation by means of which 
is brought into coincidence with td. Then to will be the elementary 
rotation which will bring tCi back from the plane of development 
m to its original position as tangent plane at P^. Let R be a generic 
tangential vector drawn from P; in order to find the equipollent 
vector Ri at Pi, we draw from when in the plane of develop* 
ment, the vector equipollent in the ordinary sense to R, and then 
bring the plane Wi back to its original position, carrying with it 
the vector so constructed. Thus the vector is merely R, 
after having undergone a displacement (of no interest if we 
consider the vector independently of its point of application) 
and also the rotation c*>. From the elementary principles of 
rigid dynamics we find that the difference between the vectors 
Rj and R, i.e. the vectorial increment rfR of the vector R during 
the parallel displacement from P to P^, is given by 

dR — c*> A R, 

i.e. the vector product a>R. 

As both CO and R are vectors in the plane w it follows that the 
increment (iR is perpendicular to this plane, or, in particular, 
is zero.^ 

We shall now show that this condition, combined with the 
condition that Rj is a tangential vector (i.e, belongs to tCi), 
completely determines the vector R^, so that we may take as 
the differential definition of parallelism with respect to a surface 
the following geometrical relations, in which n denotes the normal 
to w: 

dR li n, (18) 

R, II TOj. 

To prove this, note that the equation 
R == R|^ — dR 

must be satisfied; i.e. that it must be possible to resolve the 
vector R into one component R^ parallel to a given plane and 

^ The last mentioned caHo will occur if R haij the direction of he. the con- 
jugate of I^J\ ; in this, and only in this case, the parallel Ei with resf^ect to the 
surface coincides with the Kuclidean parallel. This remark is due to niiOFEssoit 
Bompiani, who has made use of it to generalize the theory of systems conjugate to 
surfaces belonging to non-Euclidoan spa<*es; cf. Atti dc( hi, V'erieto, Vol. JjXXX., 
1921, p. U2Q, 
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another, — dR, parallel to a given direction not contained in 
the plane; it is known that this can be done in only one way.^ 

13. The intrinsic character of parallelism. 

Returning to the question of parallel displacement along an 
arc T of finite length, we see at once that if T is a segment of 
a geodesic, parallelism depends solely on the intrinsic properties 
of the surface a; i.e. it depends on the nature of the linear element 
ds, and not on the configuration of the surface in space, as might 
a priori have been supposed from the geometrical construction 
(which uses the surrounding space) or the equivalent formulae 
(18) and jl tDi- 

In fact, we need only recall the two general properties of the 
conservation of angles and the autoparallelism of geodesies. 
The parallel at to generic direction u drawn from P 
is determiiiiKl by the conditions of (u) belonging to the surface 
a, and (&) of making the same angle at P^ with the geodesic of 
displacement as u does at P, It will be seen that we are dealing 
with angular properties which depend solely on the metric of a. 

This argument for a geodesic T can easily be extended to 
the general case, if we suppose T divided u]) into elementary 
displacements, from a generic point P to a very near jioint P^. 
In a displacement of this kind the elementary change in the 
direction u is determined, as we have seen, by the extremities 
PPj*, the nature of the line joining these extremities has no 
effect, and we may therefore think of it as a displacement along 
an infinitesimal segment of a geodesic. But a displacement of 
this kind depends only on the intrinsic properties of the surface; 
hence we see that in general this is true also for the change in 
u, and therefore for parallelism, whatever may be the line of 
displacement. 

The same result holds gootl for equipollence, i.e. for the 
displacement of vectors of any (non-unit) length whatever. 
In fact (§ 11), this length by definition remains unchanged. 

^ Some intereKtiiig geometrical coTiRe<|ueTices, especially for the case of ruled 
surfaces, have been pointed out by A. Myller in some notes in Compiea 
Rendus ; of. Vol. 174 (19*22), pp. 997-'998; Vol. 175 (1922), pp. 939-941 ; Vol. 176 
(1923), pi>. 483-48.5. Cf. a]s(> a recent not<* by O. Mayer: ** Une intorprtjtatitm 
geonuStrique de la seconde forme <|nadr.atiq\ie femd amen tale d’une surface en 
relation avec la tb^orie <lu ]^arallelisme ”, ibid,, Vol. 178 (1924), pp. 954-956. 
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14. The symbolic equation of parallelism. 

The condition (18) can be put in a more expressive form if 
we note that it is equivalent to saying that the vector dR is 
perpendicular to every direction which is tangential to a at P, 
or in other words, if we think of such a direction as being deter- 
mined by an infinitesimal displacement of the point P along 
the surface, that is perpendicular to all these displacements. 
In symbols, if SP denotes the infinitesimal vector representing 
the displacement, we shall have 

dR X SP = 0 (19) 

for an?/ SP whatever which is tangential to a - an equation similar 
in form to the equation of virtual work. If dV^ (v 1, 2, 3) 
denotes the components of dR, and Sy^ (v — 1, 2, 3) the com- 
ponents of SP (in both cases referred to the orthogonal Cartesian 
co-ordinates y^^, we have identically 

rfR X SP - . . . (20) 

1 

and the vectorial relation (19) is thus transformed into the scalar 
relation 

- 0; .... (19') 

1 

this, or the original equation (19), may be called the symbolic 
equation of parallelism, 

15. Intrinsic equations of parallelism. 

As the symbolic equation involves geometrical elements which 
do not belong to the surface, it does not show directly that paral- 
lelism is a concept depending only on the intrinsic properties 
of the surface. But we can deduce from it without much diffi- 
culty other equations which have this important characteristic. 

In order to do so, we shall naturally try to find the values in 
terms of intrinsic elements of the quantities dY ^ and Sy^, which 
occur in equation (19'). Take first the displacements 8;y,,. The 
only condition imposed on them — other than that of being 
infinitesimal — is that they represent a displacement along the 
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tcA 

Burfaco of cr, they can therefore be expressed in terms of the 
corresponding (arbitrary) variations Sajj, of the surface co- 
ordinates, by differentiating the equations (1). We accordingly 
have 

1 dx,, 

As the vector R is tangential, we can define it intrinsically 
by means of its contravariant components, and substitute for 
the Y,/s the expressions (12). 

Putting 



for the sake of shortness, the identity (20) can therefore finally 
be written in the form 

dR X SP - . . . (20') 

1 

since Sccj, completely arbitrary, it follows that the symbolic 

equation of parallelism is equivalent to the two following equa- 
tions: 

0 (jfc -- 1, 2). ... (22) 


These are the two equations which define the increments 
dR^, dR^ to be assigned to the components of a generic vector 
R when it undergoes a parallel displacement along the elementary 
path da;*,; that they arc really intrinsic equations will be 
clear when the expressions are written out in full, as will now 
be done. 

Differentiating the product on the right-hand side of the 
equations (21), and using the expression for the coefficients 
given in formula (3), the expression for Tf. becomes 




1 I 


or 


T/, 


- 4 + ki R’ k • ( 21 ') 

i‘ i' 1 oxj^ dxjdxi 


We have now to show that the result of the summation with 
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respect to v can be expressed in terms of intrinsic elements; 
more precisely, that it is a linear combination of the derivatives 
of the coefficients Consider its general term, and note that 
we can write 

0% dxjdxi dXj \ 9 x* dxj dx^dx^ 8®/ 
or analogously 

dx/f. dxjdxi 3^/ dxidXf^ dxj 

In order to maintain symmetry in the indices jf and Z, we shall 
take half the sum of the expressions on the right of these equations 
to represent the value of the term in question. Noting that the 
sum of the two terms preceded by the minus sign is exactly the 

derivative with respect to of the product we get 

0 Xj V X^ 

^ 1 / 9 .V. ^yA , ^ pjj. ^y\ _ 

dxj^ dx^dxi ^ \jdXj ^xJ dx^dx^ dxj dxf\dx^dxj J* 

Now sum with respect to v. Remembering the values of the 
coefficients a^f^, we get 

s ^y- =. i P®« -f- ^3^ — ^^f\. 

i^'dxjf. dxjdxi 

Hence this sum has been put in the required form. The 
right-hand side of this equation is represented shortly by the 
symbol 

[ji, *] 

(ChristoffeVs sy^nbol of the first kind); which is easy to remember, 
the arrangement of the indices corresponding to that of the 
negative term of the linear combination above, while the two 
positive terms have the same indices but differently arranged. 
We shall investigate presently some properties of these symbols; 
for the moment we need only remark that they represent certain 
functions of the surface co-ordinates x^ which depend only 
on the fundamental quadratic form. 
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Returning to the expression (21') for the quantities we 
can now write it in the form 

Ti- = ij o*; dR^ + 4 1> 2). (21") 

1 1 

Before continuing the argument, it is important to note that 
the quantities t/., which (as shown by equation (2J') ) depend 
on two vectors (R and the disjdacement dx^, dx^) as well as on 

the coefficients of ds^ and their first derivatives, are covarianL 

2 

This follows from the invariance of the linear form 
which is itself shown by the identity (20'). ^ 

The system which is the reciprocal of the t//s, namely, 

T* --- (i=l, 2), 

1 

is accordingly contmvariant\ using equation (21"), it can be put 
in the form 

t' dfJB* + 'Lji R^ dxi [jl, Jc], 

1 1 

2 

or, putting a'’‘ [jl, k] - 

1 

{Christoff eV^ symbol of the second kind)^ in the form 

r' = dR^ + iji { jl. i \ dx,, . . (21'") 

The equation, s of parallelism (22), as is a priori to be expected 
from tlnnr geiomotrical significance, are invariant whatever system 
of curvilinear co-ordinates Xg is chosen. This is evident from 
the fact that they express the vanishing of the co variant system 
Tj. (cf. remarks on pp. 71, 84). The equations of parallelism can 
of course also be put in the equivalent form 

0 {i 1, 2), ... (22') 

which also shows that they are invariant. 

Solving them for tlie differentials dR\ we get 

0 

dR‘ = - {jl, i]Rjdxt (i = 1, 2). (23) 

1 
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This is the final form of the differential equations of paral- 
lelism. It gives the increments of tin* contravariant components 
of a surface vector in an equipollent displacement along the 
elementary path dx,^ expressed in terms of the the com- 

ponents of the vector, and certain functions of position (to be 
considered as given) depending only on the coefficients of ds^ 
and therefore on the intrinsic nature of the surface. 

1 6. Christoff el's symbols. 

We have introduced the symbols 



[jl, i] ^ . . . (25) 

1 

which can also be formally extended to quadratic forms in n 
variables; we now propose to examine their more elementary 
properties. 

First, it is obvious that both symbols are symmetrical with 
respect to the couj)led indices, i.e. that 

Consequently for a form in n variables there are n of each 
kind corresponding to each pair of indicc^s. H(aice there are 
in all + 1) of each kind (the number of first derivatives 

of the coefficients 

It is easy to express the derivatives of the in terms of 
Christoffers symbols. Writing down equation (21) and the 
corresponding equation obtained by interchanging I and k, and 
adding them, we get the following formula, which frequently 
occurs: ^ 

^ l]. . . . (24') 

OXf 

From equation (25), applying Cramer’s rule in the usual 
way, we can get the symbols of the first kind in terms of those 
of the second kind. Multiplying by and summing with respect 
to i, we get in fact 

[jl,m] . . . (25') 

1 
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Lastly, we shall prove a formula which is frequently used 
and gives the dc;rivatives of the determinant a (or more precisely 
of its logarithm) in terms of Christoffers symbols. 

Applying the usual rule for differentiating a determinant of 
order n, we see that the derivative of a with respect to any one 
of the x’s (say x^) is the sum of n determinants, any one of which 
(say the jfcth) is obtained from a by replacing the elements of 
the Ath row by their derivatives. A determinant of this kind, 
expanded from the kth. row, can be written in the form 


n 



dx 


(the co-factor of being multiplied by a); hence 


da 

dxi 


” Iff 

'kj ^ a' a. 


„ 2 ^'^ik „ 'lk 


or dividing by a, and using formula (24'), 

k] -f 

OX, 


Finally, by formula (26), wc get 

OX, 1 1 


The two sums in this formula differ only in the letter chosen 
to denote the index of summation; hence we have 


d loga 

dXf 


n 


2S* (ki, *}. 


This formula is more frequently written in the form obtained 
by dividing by 2, i.e. 


d logv / a 

dXi 


1 


(i = 1, 2, . . . «). . (26) 


17. Eanations of parallelism in terms of covariant components. 

It is easy to find equations analogous to (23) for the differentials 
of the covariant components of the vector B. These components 
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in fact are obtained from the contravariant components by means 
of the relations (cf. §§ 3, 6, pp. 90, 96) 

Ri = ^,aijR’; 

1 

hence, differentiating and changing j into k in the second sum, 
dRc = 2,, dx, Ri + 2,, dR\ 

, 1 OXi 1 

Now substitute for dR!^ the expression given for it by formula 
(23), and we shall have 

dR, ^ hji dxi Rj — %i,, {jl, A:} Rj dx^ 

l' OXi 1 

In the first sum, we can express the derivatives of the co- 
efficients a,j in terms of the symbols of the first kind, so getting 

{[jly + [tl, 

in the second, we can sum with respect to h (cf. formula (25' ) , 
so getting 2 

1 

We thus have 

dR, ^j,[il,j]dxiR^. 

1 

In order to make the contravariant components disappear 
altogether, we substitute for R' from the formula 

Ri = 

1 

summing with rt^spect to j (which, by formula (25), changes the 
symbol of the first kind to one of the second kind), we get 

2 

dRi =^- { ^ 7 , k} R,^ dxi. 

1 

Finally, changing k into j in order to show more clearly the 
analogy with the equations (23), we have the equations 

dR, ^ji{il,j}Rjdxc (i=l, 2). . (27) 

1 


( I> 656 ) 


5 
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which are equivalent to (23). They are in fact the result of com- 
bining certain formulee and identities with the equations (23); 
and reciprocally, starting from (27), an analogous process will 
give (23), as can easily be verified. 

18. Some analytical verifications. 

We are now in a position to give an analytical proof of some 
properties of parallelism which have already been obtained as 
immediate consequences of the geometrical definition. 

Consider first the parallel displacement of a vector R along a 
finite segment T of a curve, from P to Let the curve be 
defined by the parametric equations 

(28) 

where s represents any parameter (which may, if we wish, be 
the length of the arc measured from an arbitrary origin Pq). 
The quantities R are to be considered as functions of s with 
arbitrarily assigned values at P. The equations (23), divided 
by ds, become 

1 

where the dot indicates differentiation with respect to s, and 
the quantities are of course obtained by differentiating equa- 
tions (28), and are therefore to be considered as given functions. 
These are two linear differential equations of the first order, 
in the normal form with respect to the derivatives of the two 
unknov^m functions R, R; hence, as is known from the calculus, 
they uniquely determine thes<' two functions w^hen the (arbitrary) 
initial values are given. We have thus a confirmation of the 
geometrically obvious fact of the j)ossihility of dispku'iny an 
arbitrarily assigned surface vector, and of the unkpwness of Lite 
result. 

Using the differential equations already found, we shall now 
prove that the length of a vector and the angle beiween two vectors 
are unchanged by a parallel displacement. These two results 
can be proved simultaneously, as follows. Let R, V, be two 
vectors. Give them a parallel displacement along an infinitesimal 
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path, and calculate the change in their scalar product due to 
this displacement. We shall have (cf. formula (14) ) 

rf(R X V) = iiRUVi + VidRH 
1 1 

substituting for dR* and dVi from (23) and (27), this becomes 
d(E X V) == Sy, R’ {il, j\ Vj dxi — S,;,; {jl, i} R^ dxf 

Interchanging i and j in one of the two sums, we see that the 
sums are equal, and therefore 

d{R X V) 0; 

i.e. the scalar product is unchanged by an infinitesimal (and there- 
fore also by a finite) displacement. Now let V coincide with 
R, so that R X V — R^, and we at once obtain the result 
that the length of a vector is unchanged by a parallel displace- 
ment. Combining this result with equation (13), we see that 
as the scalar product of two vectors and their respective lengths 
are all uiiclianged, the angle between them (provided neither 
vector is of zero length) must also remain the same. 

19. Permutability. 

While a tangential vector is intrinsically defined by two 
numbers, the geometrical notion corresponding to it, as we have 
already said, is a segment of a tangent line at a ])oint P of the 
surface C7 — an entity which does not belong wholly to a, cat least 
in general. If, however, wc are dealing with an infinitesimal 
vector, the element of the tangent plane in wdiich it lies coincides 
with the element of the surface a around P, and we may say that 
we are using only points lying in cr. Hence, for a generic infini- 
tesimal tangential vector we can use the ordinary notion of a 
displacement from the origin P to the final point P^, where 
Pi also lies on a. As the length R reduces in this case to a linear 
element ds, it follows from the definition of direction parameters 
that the quantities R, which are equal to A'd.v, are identical with 
the increments dXj^ of the curvilinear co-ordinates in passing 
from P to Pj. 

Next, consider two systems of differentials Sx^, and the 
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corresponding infinitesimal vectors (or displacements) dP — 
PPj, 8P — PPg (assumed to lie in a). We shall use the symbol 
df to denote the increment of / (where / is a generic vector or 
any scalar or vector quantity derived from it) corresponding 
to a parallel displacement from P to P^; the symbol S/ 
will be defined in the same way for the displacement from 
P to Pg. 

With this convention dhP will represent the vectorial incre- 
ment of 8P for a displacement from P to P^, and the incre- 
ment of the associated contravariant system For the latter, 
equation (23) gives 

2 

dZx^ = — [jl, i\hx^dxi {i -=1,2). . (29) 

Similarly, the displacement of dP from P to Pg gives the 
increments hdxi^ for which we have 

2 

hdxi — . . . (29') 

Interchanging j and I in one of these two sums, and using the 
property of symmetry of Christoffel’s symbols, we see that 

d8xi = Sdxi, (30) 

which proves that the two operators d and 8, as just defined, are 
permvtable. 

The geometrical meaning of this result is particularly simple. 
Note first of all that for infinitesimal vectors — the only kind 
considered here — the elements of the contravariant system are 
merely the differences of corresponding co-ordinates. Hence, 
if the co-ordinates of P are the .x/s, we shall have in the first 
place Xi + dx^ as the co-ordinates of P^, and ic, + 8a;^ as the 
co-ordinates of Pg. Let Q be the point on cr reached by construct- 
ing at Pi the vector equipollent to 8P; as the contravariant 
system for this vector is dSx-, we get finally 

+ dxi + 8x^ + dSx, 

as the co-ordinates of Q. 

Similarly let be the point on a reached by constructing 
at P 2 the vector equipollent to dP; we get the co-ordinates of 
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Q* by interchanging the operators d and 8 in the co-ordinates of 
Q. which gives ^^ + 8x, + dx,+ Sdx, 


Applying equation (30), we see that Q coincides with Q^, 
A more illuminating way of expressing the same result is to say 
that the parallelogram, rule holds for infinitesimal vectors which are 
equipollent with respect to a surfaced 

It may be noted that in the foregoing argument second -order 
quantities of the type dhx have been taken into account, but 
{dx,)^, have been neglected. If the latter were to be taken 

into account, by considering the vectors 8P, dP and the equi- 
pollent vectors at P 2 and as vectors in space, we should no 
longer have a parallelogram, nor even a closed quadrilateral. 
In fact, referring to the space construction already given (cf. 
p. 105) for vectors equipollent with respect to a surface, we see 
that while dSP and SdP are both in the direction of the normal 
to (T at P, yet their lengths are in general different, since the 
three points P, P^, Pg and their respective tangent planes have 
a priori no relation between them except that of being infinitely 
near one another. 

The formute (29) or (29') provide a definition of the second 
differentials which is invariant with respect to any change of 
variables. In order to grasp the significance and value of this 
fact, we must recall the conventions as to second differentials 
which are adopted in the elementary theory of the calculus. 

To fix the ideas, consider the simpler case of a single inde- 
pendent variable. Ordinarily the convention d^x — 0 is adopted; 
i.e. the increments dx are considered independent of x^ as is 
quite legitimate. But this simplification does not hold if we 
change the independent variable by putting x /(^^), from which, 
on the hypothesis that we have a reversible transformation, we 
can reciprocally find as a function F{x) of x. In fact, differ- 
entiating twice the formula f we get 

d^ == F'{x)dx, 

d^i ^ P"(x) (dx)^ + F'(x) d^x, 

^ This property might be taken as tho starting point for an intrinsic proof of 
the properties of parallelism, depending only on the metric of <r, and making no 
uso of the surrounding space. The method can be applied directly to manifolds 
Vn of any numU?r of dimensions. Cf. H. Wetl, Jiaum, Zeit, Materie^ §14 (Berlin, 
Springer, 1923). 
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which shows that even if we make <Px ~ 0, will not in general 
be zero. 

If then there are n variables, it is usual to consider only 
systems of differentials which are completely independent of 
the variables Xi, so that we have not only d'^Xi = 0, but also, 
for any two systems dx^, of these differentials whatever, 

dSx^ — 8dx^ ~ 0 (i = 1, 2, .. . n). 


Now change the variables, by putting x^ = fi(x), and there- 
fore X, =s FX^)- Using the condition hdx, — 0, we get 


Mxi = 1^,1 

1 

so that the property 

Mx, 


dXj dx-t 


dXj Sxi, 


- dSxf 


also holds, but these differentials will not in gc'iicral be zero. 
The usual convention is therefore legitimate, and is suggested 
by obvious reasons of simplicity, when in a given quitstiou we are 
dealing always with the same variables; but it is not invariant 
for a change of variables. 

If instead we adopt the formula) (29) and (29'), and suppose 
that 

dSx, 8dXi -= — {jl, i} hxjdxi, . (31) 

1 

we get, for the same geometrical interpretation of this formula, 

2 

dZxi — hdxi {fl, i} 8x)dx„ 

1 

where the line above the letters denotes that CliristofTers symbols 
refer to the variable's x, i.e. to the transformed quadratic form 

2 

ds*^ = dxi dxf^, 

1 

We could of course verify by direct substitution that the 
form of the expressions (31) is unchanged by the change of 
variables. We are in fact dealing with an immediate corollary 
of the invariance of the equations t' ~ 0 (cf . § 1 5), which follows 
at once by putting R' = Sx^ in these equations. 
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On account of this invariant property the second differentials, 
defined as in (31), arc called contravariant, although strictly 
speaking the term applies not to them but to the expressions 

dhxi + { jl, i } 8xj dxi, 

I 

which in any case (see § 15) constitute a simple contravariant 
system. 

(c) Extension of the Foregoing Notions to w-dimensional 
Manifolds of any Metric 

20. n- dimensional manifolds. 

Alongside tlie extension of the use> of geometrical terms which 
was developed in Cha}>ter 1, we shall now introduce, on the lines 
of the discussion in subdivision {a) of this chaj)ter, the fundamental 
notion of an ti-dimensional metric manifold, where n is any 
integer. 

If there are n variables know that the aggre- 

gate of values which can be assigned to them is called an n- 
dimerisional manifold. Now supj)ose that together with these 
variables and their field of variation there is also given a priori 
a differ ential quadratic form 

ds^ =-■ 'L,^ai„dx,,dxi, (32) 

1 

in which the coefficients a^j. are given functions of the aj’s, and 
a, A- ™ %/. We shall agree to consider ds as the distance between 
the two infinitely near points whose co-ordinates are cTg* • • • 
and + dxj, + dx^, . . . x„ -j- dx,^\ we shall in consequence 
agree that ds is to be invariant for any change of co-ordinates. 
Having thus introduced into the manifold the notion of an 
elementary distance, we get from it at once by integration the 
notion of the length of a line, and also deduce from it, as we shall 
see, the most direct criteria for defining all the properties of 
extension (angles, areas, volumes, &c.). 

A manifold with which has been associated a quadratic form 
of the typfj (32), or in other words, a manifold whose metric ic 
given, is called a metric manifold, and will be here denoted briefly 
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by Vn- Since ds^ is invariant, the coefl&cients obviously form 
a symmetrical covariant double system; we shall throughout 
the discussion suppose that they and their first and second 
derivatives are finite and continuous functions, and so chosen 
as to make the quadratic form definite and positive.^ Thus the 
distance between two real points will always be real; the deter- 
minant a of the coejHicients will always be positive. With the 
usual notation the reciprocal elements will be denoted by 
&c. 

We shall now extend the concept of direction to a generic 
F„. We shall consider direction as determined by two infinitely 
near points, i.e. by a system of dx^s. As before, we shall apply 
the term parameters to the n contravariant quantities 

A‘ {i 1, 2. . . . n) 

ds 

which define a direction (and are uniquely determined by it), 
and we shall apply the term tnomcnts to the covariant quantities 

A, {i 1,2, . . .n). 

1 

Thus for any value of n we have again two simple systems, 
reciprocal with respect to or to the form (32) (cf. p. 96, 
Remark IIT). 

The parameters are connected by a relation completely 
analogous to (5), and the formulae (5'), (5"), and (O') can be 
extended without difficulty, the summations being now from 
1 to n instead of from 1 to 2. The aggregate consisting of a direc- 
tion and a positive number R will be called a vector R in a 
{R being the magnitude of the V€»ctor); the products of R by 
the j)arameters of the direction will be called the contravariant 
components R\ and the products of R by the moments the 
covariafit comj^onents Ri. We shall then have a set of formulso 
analogous to (11), (11'), (11")- 

Sui)pose the x’s expressed as regular functions (i.e. finite and 
continuous, together with all their derivatives which enter the 

^ At the end of the eliapter (p. 141) we shall also coiiisider shortly the case of 
an indefinite quadratic fomi. This case was at first neglected as offering little 
likelihood of useful application, but the theory of relativity has now invested 
it with very great importance. 
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discussion, in the field considered) of p parameters 
where jt) is a positive integer less than n: 

• • • "^p) 1> 2, • • • w), • 

We shall make the hypothesis that at least one set of p func- 
tions/ is independent, i.e. that p is the characteristic of the func- 
tional matrix of the /'s with respect to the w’s. Hence the 
are connected by n — p relations and no more, namely, those 
which we should get by eliminating the u’s from equations (33). 
In this way we define a subordinate j^-dimensional manifold 
Wj,, whose co-ordinates are the is said to be contained or 

immersed in since to every system of p values assigned to 
the u^B there corresponds, by (33), a system of n values assigned 
to the ic’s (i.e. every point of belongs to while not all 
the systems of values which can be assigned to the x^b satisfy 
the equations (33) (i.e. not all the points of F,, belong to W^)- 
Now, remembering the analogy with the case n = 3, 2 

(cf. p. 87), we naturally assign to the distance between two points 
of the subordinate variety the same value (32) as that of the 
distance between the same two points when they are considered 
as belonging to F,,; i.e. we construct ds^ for the subordinate 
manifold by substituting in (32) for the dx^B their values obtained 
by differentiating the equations (33). In this way we can easily 
find the coefficients of the fundamental quadratic form in the 
d?^’s, and the metric of the /^-dimensional manifold W.^,^ immersed 
in F,,, will be completely defined. For /? = 1 the definition 
coincides with that given in Chapter I, § 1 , for a line, of which the 
equations (33) are the parametric equations, 

\i p ^ n — 1 , the TF^ is often called a surface.^ or more pro- 
perly a hypersurface. 

21. Euclidean manifolds. Any F,^ can always be considered 
as immersed in a Euclidean space. 

If ds"^ reduces to the sum of the squares of the differentials, 
as in the case of orthogonal Cartesian co-ordinates, the quadratic 
form is said to be Euclidean^ and the co-ordinates, by an obvious 
analogy with the elementary cases n — 2 and n = 3, are called 
orthogonal Cartesian co-ordinaZes, When tliis is so, all Christoffers 
symbols obviously vanish identically, since the coefficients 

(D655) A* 
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are constants. Given a generic F„, and therefore a generic 
it is not in general possible to bring about a change of variables 
such that ds^ takes the Euclidean form, or in other words to 
establish a system of Cartesian co-ordinates in F,,; if it is possible 
F„ is called a Euclidean manifold^ and we shall denote it by 
S,^. We shall find later on the conditions to be satisfied by the 
a^*.’s in order that may be Euclidean. F,,, however, can always 
be considered as immersed in an iNT-dimensional Euclidean variety, 
where IV > n, as we shall now show. 

We propose to determine N functions of the a?’s, 

such that when we differentiate them and take the sum of the 
squares of the differentials we get a form, quadratic in the <Zip’s, 
which is identical with the given ds^, so that we have identically 

IV n 

I 1 

Expressing the dy’s in terms of the dx’a, we have 

Si* dXi dx^ i:,* a,,, dx^ dx^ 

I 1 dxi dXf, 1 

or, equating the coefficients of dx^ dxj^, 

(*■’ * ^ 2 n). (35) 

1 O Xf^ O Xj^ 

We have thus obtained \n{n + 1) partial differential 
equations of the first order in the N unknowns y\ unless any of 
these are mutually inconsistent (and a more detailed discussion 
would show that this is not so) we deduce that the problem is 
soluble for N = \n{n +1), and a fortiori for N > ^n{n +1). 
The 2/’s can evidently be considered as Cartesian co-ordinates in 
a Euclidean manifold (space) m which the given F,^ is immersed, 
F.,i being parametrically represented by the values of the y’s 
in (34) (cf. formula (33) ). It is therefore possible to immerse a 
generic Fu m a Euclidean space Sj^ provided N > An(n + 1). 
For particular F,/s, however, a smaller number of dimen- 
sions may suffice; e.g. for a Euclidean n dimensions are 
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sufficient; in this case the are Cartesian co-ordinates of 
itself. 

If N has the smallest possible value, the difference N — n 
is called the class of the F,,. Since the minimum N is not greater 
than \n{n -f- 1), the class cannot be greater than -f- 1) — 
or ^n{n — 1), Further, N cannot be less than and therefore 
the minimum value of the class is 0. For n =-= 2, the class is 1 , 
which shows that every binary ds^ may be considered as belonging 
to an ordinary surface. In other words, the parametric expressions 
which were our starting point (p. 86) impose no restrictions on 
the study of the intrinsic properties of a ds^ in two variables. 

22. Angular metric. 

We shall now extend to the generic F„ the notion of the 
angle between two directhns. The most direct method is by the 
formal extension of formula (8) (and its equivalents) by summing 
from 1 to n inst(iad of from 1 to 2; this however will be legitimate, 
if we wish to avoid imaginary values of O', only when we have 
shown that the expression on the right < 1. 

In order to do this, we shall examine some algebraic properties 
of quadratic forms. 

Let . 

9zz ^ 

1 

be a definite positive quadratic form. Suppose that the 2 ’s are 
linear combinations of two different systems of non-proportional 
variables, so that we may put 

Zi = Ax, + /iy,; 

we therefore have 

n 

- -- {Xxi + ixy,) (Ax^ + ix.y„) 

= S,. a,* [A^ X, X* + A^ (x. yi, + y, x.) + y, y.]. 

1 

’A quadratic form <p called irreducihle when the number of 

1 

independent variables cannot l>e reduced by sul>3titutin^ for the f k linear com- 
binations of them. ThivS is always so when the form is <lefiiiite, as in this cose the 
determinant a of th«- eoetlicients la certainly not zero (p. 90). A' cannot therefore 
be loss than n. 
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Splitting up the right-hand aide into three sums, and putting 

71 

^ik =" ^xx* 

1 

71 7 \ 

^ik ^ik Vk — ^/A: ^Uc Vt ~ = ^yx7 

1 1 

71 

^ik Vi Vk ~ ^yy9 

we have finally 

<i>2z = ^<l>xx + • • (^ 6 ) 

This may be considered as a quadratic form in A and ft; it 
is easy to show that it is definite and positive, i.e. that it is 
always greater than 0 when A and /x are not zero. In fact, 
considered as a quadratic form in the z\s, is always positive, 
provided at least one of the z’s is not zero; and this condition 
is equivalent to our hypothesis that the x’s and y*Q are not 
proportional. 

From (36) we therefore get 

+ 2 A /X <f>^y + > 0, 

whatever A and /x may be. 

Hence, from an ordinary property of quadratic inequalities, 
we get 

4>xx <f>uy — 4>ly >0, .... (37) 


which is the formula we wished to prove. 

We now return to the proposed formal extension of formula 
(8). What we have to prove is that 


i.e. that 


( n . • \ a 

^ik ^ik 

1- >0, 


whatever A*^ and /x^ may be, provided they are not proportional 
(since we exclude the obvious case where the directions coincide 
or are opposite). 

This inequality can now be proved at once. Introducing the 
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quadratic relations between the parameters, we can write it in 
the form 

> 0 ; 

and this is merely (37), with the a;’s and j/’s replaced by the A* *8 
and 

We may therefore assume 

cos9- = fjJ% .... (38) 

1 

and the other expressions equivalent to it will also hold good, 
namely, 

cos^ (38') 

1 

cosS* -- X, /x‘, (38") 

1 

cosS -- . . . (38"') 

1 

in which the moments (cf. § 20) of one or both directions take the 
place of the corresponding parameters. 

In the provisionally excluded case of two coincident or opposite 
directions (A* + /x'), we must naturally agree that COHO' :== + !. 

With this convention the four formulte just given still hold 
good; the right-hand side in each case also reducing to + 1 in 
virtue of the fundamental relations 

S,,ai,A'A*- = S.A.A' - S;,a*A„\ - 1 
1 1 1 

between the parameters and moments of a single direction 
(cf. §20). 

Now consider our immersed in a Euclidean space S^r. 
Given two directions X, (i belonging to F„, and drawn from the 
same point, the angle between them is defined in two ways, 
since the directions X, p. may be specified either by their para- 
meters A^ /x^ relative to F„, or by their parameters A"", relative 
to and formula (38) may be applied to either set. We 

^ Wo may note in passing that in a f^uclidean space, referred to rectangular 
Cartesian co-ordinates, the parameters of a direction coincide with its moments; 
also (as follows directly from the properties of linear orthogonal substitutions) the 
forrnulfie of covariance are identical with those of contravariance (cf. § 3, p. 67). 
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shall call the angle calculated in the first way 0- and in the 
second S’', and we shall show that GOSS' == cosS'. 

Remembering that for the Cartesian co-ordinates y of 

ds^ has the form S dyjl we have 

n 

cosl)- = 

1 

cosQ-' — S,, 

i 


Now the parameters A'", /x'*' are given by the formulae analogous 
to (7), (7') 

V.- _ V ^y.- V V' 

ds ~ I'dx, ds ■ 7'~dxi ’ 

We have therefore 

cosf^' -- S, ^•^■'AV'' --- ^y-', 

1 1 aX^ OX/,. - .‘'Onr., r) ^r.. 


i“dx: dxj. 


and therefore by (35) 

n 

GOSS'' — Sij. A’ o,vi. — cosl>. 

Q.E.D. 


Now consider two vectors R, V, whose directions are X, (X 
respectively. We can extend the definition of the scalar product 
by giving this name to the invariant 

R X V RV cosf)-, 


where & is the angle between the two directions. For each of 
the various expressions for cosx)- we .shall get a corresponding 
expression for the scalar product by multiplying (38), (38'), 
(38"), (38'") in turn by RV. The restilting formulae are: 

R X V = a., R^ r- -- S, R F, - S, R. F‘ - S,, R, V,. 

1 111 

If one of the vectors, say V, is of unit length, we shall call 
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the product R x V the projection of the vector B on the direc- 
tion determined by the unit vector V, or its component in this 
direction. 

The orthogonality of two non-zero vectors is evidently ex- 
pressed by the vanisliing of their scalar product. 

We can now make some useful remarks relating to certain 
particular directious. Let s^- denote ^ the direction of the co- 
ordinate line i (i.e. the unit vector in that direction, in the sense 
of the xjs increasing); remembering that for a displacement 
along the line i we have dx, 0 for r =|=^ i and ds == \/ a^^dx^, 
we see that the parameters s'‘ of the direction s, are all zero 
except the ith, so that we have 

< - 0 (r 4: i), s\ = j 

V a , 

On the other hand, the direction of the normal to the co- 
ordinate hypersurface Xj = constant (the normal meaning the 
direction perpendicular to any direction drawn on the hyper- 
surface) has its moments Uj j , all zero except the jth. For must 
be per])endiciilar to each of the n — 1 directions s- {i j)> so 
that applying formula (38') to the values just found for the 
para, meters of we can write 


whence — 0 for i^j. The value of is therefore deter- 
mined by the quadratic identity between the moments, which 


gives 



if we suppose that the sense of is that of the x/h increasing; 
for the opposite sense the radical must have the minus sign. 

That the direction so defined (at a generic point) is actually 
perpendicular to any direction X (through the same point) on 
the hypersurface Xj ~ constant, follows from the fact that for 
every such direction the parameter is zero, and therefore 

0 . 

1 Tlic Hutfix t is n<it of c'oarso an index of covariance. 



128 


INTRODUCTORY THEORIES 


Appl 3 dng the above remarks, it can at once be seen that the 
angle < 0 ^*. between the co-ordinate lines i and h is given by the 
formula 

COSCOij. — 


while the angle between the co-ordinate hypersurfaces = 
constant and jc* ~ constant (i.e. the angle between their normals 
Hi and %) is given by 


COSffijfc = 




These formulae show the real meaning of the coefficients of 
efc®, and the geometrical interpretation to be given to their 
vanishing. 

We shall now try to find the geometrical meaning of the 
covariant and contravariant components of a vector R. For 
this purpose we shall calculate the orthogonal projections of 
B on the directions and n,. We get for these 

R 

R X S, = 

1 va,, 

n T>i 

R X 

1 va'' 

These results show that 72, and 72 ‘ represent the projections 
of the vector R on the co-ordinate direction i and on the normal 
to the co-ordinate hypersurface — constant, multiplied by 
\/ an and \/ a!^ respectively. 


23. Definition of geodesics. 

We shall fix any two points A; 5 in a generic and we 
shall try to find the shortest of the lines which join A and B. 
In a certain sense this problem is analogous to that of finding 
the points at whicli a function is a maximum or minimum, the 
solution of which is of comse an important application of the 
calculus. Here, however, we are not trying to find paints^ and 
hence the values of one (or in general of n) variables which 
satisfy the required condition; we are trying to determine a 
Wne, and hence, analytically, to determine the form of n functions 
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(the parametric equations of the line). The problem is therefore 
of a higher order of difl&culty: while the former led to equations 
in finite terms, the latter leads to differential equations. The 
solution of this problem and of others related to it is the principal 
object of the calculus of variations. We shall recall shortly the 
fundamental idea of this calculus, which does not differ in prin- 
ciple from the idea which leads to the solution of the other more 
elementary problem of the maxima and minima of a given 
function. 

To fix the ideas, we shall suppose that there is only one 



variable. We know that if a function /(x) has a maximum or 
minimum at Xq, its differential df ~ f is zero at that 

point (and therefore /'(X q) — 0), whatever dx may be; in other 
words, for an infinitely small disjdacement to left or right from 
the point Xq, / remains constant (except for infinitesimals of 
higher order). This can also be seen intuitively from the graphical 
representation of the function (cf. the points M and N in the 
diagram). The converse, however, is not true, i.e. when df — 0 
it does not necessarily follow that there is a maximum or 
minimum (cf. for instance the point P in the diagram). The 
maxima and minima must be looked for among the points where 
df^O. 

Let us now see how we can apply this method to the deter- 
mination of the shortest line joining A and 5 , without going 
outside a given (we may think of a line drawn on a surface, 
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i.e. the case n 2). Let g be such a line; draw a line g\ having 
the same extremities as g, and infinitely near g, but otherwise 
completely arbitrary. We can consider g' as derived from g 
by an infinitesimal deformation, i.e. by displacing each point 
^2, . . . x,^) oigto If ,7 is the shortest 

of these lines, its length is not changed^ by this deformation 
(except for infinitesimals of higher order); hence if I is the length 
of g and I + SI that of g\ we have 

81 . (30) 

whatever g' may be (subject only to the conditions imposed 
above), a condition analogous to the vanishing of dfin the former 
case. Here too, however, it is to be noted tliat in general the 
condition (39) can be satisfied not only by the required line 
but also by other lines which do not give the shortest path from 
A to B. 

For instance, let A and B be on the same generator of a cylin- 
der. Then the shortest path is evidently given by the generator, 
which, as can easily bo seen, satisfies (39). Ihit all the infinite 
number of helices which pass from A to B^ wrapping them- 
selves 1, 2, . . . times round the cylindt^r also satisfy the same 
equation. 

We shall call all the lines which satisfy condition (39) geodesics. 
They possess important characteristic ])roperties, which can be 
deduced from (39); e.g. the osculating plane at any point of a 
geodesic on a surface is normal to the tangent plane to the 
surface — the property adopted on p. 103 as the definition of a 
geodesic. The Imes of minimum length between two given 
points must be looked for among the geodesics through the two 
points. 

This is the definition which we shall use below; but it is to 
be noted that some writers in defining geodesics slurb from another 
property. We could in fact show that when a point A is fixed 
on a geodesic g^ then for all i)oints B (on g) sufficiently near A^ 
g is the 07ily geodesic joining A and JS, and is therefore the 
shortest line joining them. Hence we can also say that 
a geodesic is a line such that it forms the shortest path 


^ For a more rigort^us and complete discussion the reader is referred to treatises 
on analysis. 
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between any two of its points, provided they are sufficiently close 
together. 

With this restriction the two definitions are equivalent. 

24. Differential equations of geodesics. 

We shall now examine the property, concisely expressed by 
the equation (39), that the length remains unchanged in an 
infinitesimal displacement which does not move the extremities, 
and see how to express it by means of n differential equations 
which the n functions 

X, = x^{s) {i = 1, 2, ... w) 

defining the curve g must satisfy. 

Let the equations of g' be 

X, x,{s) + 8a;, {i ^ 1, 2, . . . n), 

where the Sr's are to be considered as infinitesimal functions of 
6*, vanishing for s --- 0 and s = /, and having finite and con- 
tinuous first and second derivatives, but otherwise arbitrary. 

Take an infinitesimal segment of g, of length ds\ we 

have to calculate hds, i.e. the increment (or, as it is called, the 
variation) of ds in the deiormation which displaces P to P' 
and Pi to Py, If dxi (i — 1, 2, ... n) is the difl’erence between 
the co-ordinates of P and of Pj, the corresponding difference 
after the deformation — which we shall denote by d(x, + Sa;,) — - 
dx^ + dSx,, where dhx^ is of course the differential of the function 
SiC— is calculated as follows: 

The co-ordinates of P' are -f- Sx,; 
those of Py are (x,- -b dx^) + h{x^ -f- da;,); 
therefore the required difference is dx^ + Sda;,. 

It follows that 

hdx^ ™ dhx„ (40) 

a result which we shall at once make use of. 

We shall now take the expression for and calculate its 
variation, differentiating with the operator 8. We have 

n « n 

2ds . Sds = S;*. hajjc dxj dx^ -f- a^^ dx^ Sdxj + Sda;^. 
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Since the last two sums are identical, we can write this in 
the form (using equation (40) ) 

n n 

2ds . Sds = dxj dxj^ + ajf, dxj d^x^. 

i 1 

Dividing by 2ds^ and denoting differentiation with respect 
to s by a dot, this gives 

n 71 

Sds == Saji xj xi ds + Xj dSx*; 

and from this, since 

Sa^i 

1 9®* 

we get Ms in the form 

w n 

Ms "•=’ = ^^jik w 4” ^jfc ^jfc 

I C Xu 1 


Now since the length of g is 

I / ds, 

A 


the variation which is to be equated to 0 in (39) is 


or 


81 = [%ds, 

*' A 

81 ^ fj (s,,, X, X, Sx,^ ds + I, 


where we have put 


/ If n 

Sj* Xj dSx^. . . 

A 1 


(41) 

(42) 


We shall leave (41) aside for a moment and examine the 
possibility of transforming the integral in (42) also into a form 
which explicitly contains the arbitrary variations BXf^. Integrat- 
ing by parts we get 
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The integrated part vanishes, since S% = 0 at the extremities; 
differentiating the product, the other part gives 

/ li n rB n 

^jk ^jlc ^ ^jk (^2') 

A 1 ' ^ 1 

Expanding dajj^^ the sum under the second integral sign may 
be written 

^ikl 

^Jkl 0 ^^ ^ ^ ^ ’ 


or, interchanging j and Z, 




^Jki-px,x,hx,ds. 

1 CXj 

We shall take half the sum of these two expressions to repre 
sent the value of either. Substituting in (42'), we get 

I = — j XjhXjgds j \ (^jkl 1 ^ 0 ^"”^ Wx ^ 

We now return to (41) and insert in it this expression for I. 
Putting all the terms under a single integral sign, and talcing 
out the common factor 8x,^ds, we get 

SI = — / S^. I — ^ Xj Xi ajj, Xj 

^ 1 I 1 ax^c 1 

+ 8 ? + 8 ^] 

or, remembering the definition of Christofiel’s symbols. 


/ B n r n n -v 

Sjfc ) Sj ajt Xj + [jl, k] Xj ±1 \ Sx* ds. 
^ 1 1 1 J 


Putting 


Pk ^jk “I" ^jl 


the formula appears in the concise form 


/ B n 

^k Pk ds, 

^ 1 


( 44 ) 
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The result of all these calculations is that (39) can be written 
in the form 



(39') 


Now since (39') must hold however the arbitrary functions 
S% are chosen (subject to the qualitative conditions stated above), 
we must necessarily have 

- 0 (* = 1, 2, . . . n); . . (45) 


for if not, we need only take each with the same sign as the 
corresponding p, (which can be done without going outside the 
conditions imposed); the sum would then certainly be positive 
at all points of the arc of integration and therefore the integral 
would not vanish. This is the fundamental argument in the 
calculus of variations; by means of it we get from (39') (which 
is only (39) exi)anded) the n differential equations (45) which 
written at length are 

% ajh + 'kji [jl, k] XjXi 0 (7c •. 1,2,... n). (45') 

1 1 

It is convenient to write these equations in the form obtained 
by solving for the ic’s. To do this we introduce the quantities 

f -= .... (40) 

1 

and replace the equations (45) by the equivalent system of 
linear combinations 

P' == 0 . 

or Xi + tj,{jl,i\xjXi = 0. . . . (47) 


These n differential equations of the second order in the n 
unknown functions rr,(s) are equivalent to equation (39) and 
are therefore the characteristic equations of a geodesic', when 
integrated, they give the parametric equations of the curve. 
By the ordinary theory of such equations, the integrals will 
contain 2n arbitrary constants, which can be determined by 
the condition that the geodesic passes through two specified 
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points, or that it starts from a given i)omt and has a specified 
direction. 

It may be noted that the equations (47) contain only intrinsic 
elements, as the definition of a geodesic would lead us to expect. 


25. Geodesic curvature. 


The discussion in the preceding section provides us with an 
opening for the introduction of an illuminating and fertile 
geometrical notion relating to any curve — xXs) in 

our V,,. 

We must first show that the quantities defined by (43), 
corresponding to a generic curve are covariant (and 

in consequence the p'’s are contravariant), so that we shall 
naturally associate with every point of the curve I (which is 
a priori any curve whatever) the vector p of which they arf*. the 
components. Supf)ose then that we pass by any transformation 
from the variables x to new variables .r, and let pi^ rejuesent the 
values of the p//s calculated in the new system. We get from 
(41), through the invariance of Si, 


and therefore 



/ /» r 

s. 

A L 1 


p^.8x^ 



ds. 


By a similar argument to that used in passing from (39') to 
(45) we deduce from this that at every point of I we must have 

n II 

1 T 


which expresses the invariance of 

71 

1 

(a linear form in the arbitrary contravariant variables 8xj^) and 
therefore the covariance of the P/Xs. It follows from (46) that 
the reciprocal contravariant system consists of the p'^s, i.e. of 
the left-hand side of equations (47). 

We shall use the term geodesic curvature of the ctxrve I at 
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any point on I to denote the vector p whose covaiiant components 
are defined by (43), its contravariant components being in con- 
sequence defined by (46), or by 

+ Sjj {fl, Xj Xi (i = 1, 2, . . . n). (43') 

1 

An immediate corollary is that the gecdesics a/re the lines whose 
geodesic curvature is everywhere zero. 

More generally we have for the length of the vector p an 
important property, pointed out by Lipka,^ which we shall 
merely state without proof: The absolute value of the geodesic 
curvature is represented, as in ordinary space, by the ratio between 
the angle of contingence and the elementary arc, where the angle 
of contingence is defined as the angle, at one extremity of the 
arc, between the tangent at that point and the parallel to the 
tangential direction at the other extremity. 

Another important property is that the geodesic curvature is 
normal to the curve, which is equivalent to saying that 

= 0 (48) 

1 

since the parameters of the tangent to the curve are the 
To prove this, take the identity 

n 

Sj* atj Xj == 1 

1 

(obtained by dividing (32) by ds^) and differentiate it with respect 
to s. We get 

n n 

aj^j Xfc Xj + Hjff, d^j X/c 
1 I 

n n 

or a^j x^ Xj + Xj^ Xj x^ = 0, 

I 1 OXi 

and therefore, by (24'), 

n V n 

(f'kj b/c Xj + [jl, A:] Xj( Xj Xi + \hl, f\ Xj^ Xj Xi = 0. 

1 1 1 

^ “Sulla curvafcura geodetica delle linee appartenenti ad una vari^tA qualunque” 
in Rend, delta R. Aev, dei Lincei, Vol. XXXI (1st half-year, 1922), pp. 353--3564 
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Interchanging h and j, we see that the third sum is the same 
as the second; hence, taking out the factor 2, we have 

n n 

'^jk ^kj [jl, *] = 0 . 

1 1 

This is merely (48), with pf^ replaced by its value as given by 
(43); hence the assertion made above is proved. 

In ordinary space, as will at once be seen, the geodesic curva- 
ture coincides in direction with the principal normal, and in 
magnitude with the flexion or principal curvature of the curve. 

26. Extension of the notion of parallelism. Bianchi’s derived 
vectors. 

We propose next to extend to a the notion of parallelism 
or, more generally, of equipollence defined above for a 

In this case we have no criterion analogous to that used for 
the Fg’ general the circumscribed developable which formed 
the starting-point of the former argument does not exist. 

The differential law of parallelism, however, expressed by 
the symbolic equation (19), can be immediately adapted to the , 
case of a F„. To do so, consider a vector R drawn from a point ! 
P of F„, and let B + rfR denote the equipollent vector drawn 
from a point of F,,, very near to P, We can think of the V^, 
and therefore of the vectors R, R + rfR, as immersed in a| 
Euclidean space where N is a sufficiently large integer; we | 
can therefore define the vectors R, R + rfR, not only by their 
(co variant or contra variant) components with respect to F,i, 
but also by their components {v := 1, 2, . . . N) 

with respect to a system of Cartesian co-ordinates • y^ 

in Now consider an arbitrary infinitesimal displacement 

SP, contained in F,, and drawn from P\ it can be specified either 
intrinsically, by means of the Sx/s (i = 1, 2, . . . n), or with 
reference to Cartesian co-ordinates by means of the (i^ = 1, 

2, . . . N); but it is to be noted that while the first set are 
arbitrary the second are not, on account of the equations (§21) 
which define the y^s as functions of the We can also say, 
in geometrical language, that the displacement must satisfy the 
condition of being tangential, i.e. of belonging to F„. We shall 
define the vector rfR, and therefore the parallel displacement. 
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by means of the symbolic equation (19), which can be expanded 
(p. 107) into the form, analogous to (19'): 

=^0, .... (49) 

1 

which holds for all displacements satisfying the given condition. 

The only difference between formula (49) and (19'), which 
defines parallelism with respect to a surface, is that the sum- 
mation for v is from 1 to N, instead of from 1 to 3. All successive 
steps in the calculation follow automatically as in § 15, p. 107. 

We shall first write the equation in terms of the Sx'a by put- 
ting 

dR X 8P ^ = S,, r,, Sx,.; . (50) 

after transformations analogous to those formerly used, we find 
for the t’s the expressions 

n = %ak.jdW + Jc]R^dxi {Jc ^ 1, 2, . . . n), (51) 

l‘ 1 

an obvious generalization of formula (21"). Evidently, in view 
of t.he identity (50), we are here <lealing with covariant expres- 
sions (with res[)ect to any transformations of the a?’s). The 
reciprocal system 

I 

can also be expressed in the form 

t '- dR' -f {jl, i} Rj dxi {i = \, 2, .. . n), (61') 

l' 

in complete analogy to (21'"). 

From (49) and (50) we finally reach the intrinsic equations of 
parallelism: 

T4. = 0 (^' “ 1, 2, . , . n)\ 

these are equivalent to r' — 0, or to 

dIV -f {jh 4 R^dx, - 0 (i - 1, 2, . . . n), (52) 

1 

which define the increments dR^ of the contra variant components 



DIFFERENTIAL QUADRATIC FORMS 139 

of a vector R for a displacement parallel with respect to F,, 
from P (of co-ordinates x) to Pj (of co-ordinates x -j- dx). 

For the covariant components we find, as on p. 113, the 
equivalent equations 

n 

dR, = j}Rjdxi (i = 1, 2, . . . (62') 

1 

This equation and (52) alike show that parallel displacement 
is an intrinsic operation with respect to the metric of This 
was not a priori evident from the geometrical definition we 
adopted, which is expressed in formula (49), involving the use of 
a surrounding space 

The equations (52) and (52') are, so to speak, identical with 
the equations (21^) and (27) which hold for a the only differ- 
ence being in tlic number of dimensions. It is of course under- 
stood that {jl, and { il, j] denote ChristofteFs symbols of the 
second kind constructed with the ds^ of V 

All the properties deduced from the equations of parallelism 
with respect to a surface can be extended without difficulty to 
parallelism in F,, ; in particular, the properties that parallel dis- 
placement along any finite curve whatever is always possible, and 
in only one way, and that parallel displacement leaves unchanged 
the scalar product of two vectors, and therefore lengths and 
angles. We shall show in the following section that we can also 
extend the property of autoparallelism of geodesics, which we 
proved geometrically in the case of surfaces. 

We may also call attention here to the notion due to Bianchi ^ 
of the derivative of a generic vector B along a curve T, R being 
a function of the points of T. If the vectors R{6-) at various 
points of T are not parallel along T, the contra variant simple 
system t\ defined by (51'), is not identically zero. Accordingly 
the quantities 

'A t' (»■ = • • • '■) 

may be considered as CAmtravariant components of a non-zero 
vector Z)R which is also a function of the points of T, The 

^ Cf. “Sul paralloliemo vincoltico di I-tivi-Civita nella jnetrira degli spazi curvi 
in Rend, della It. Ace, di Napoli, Vol. XXVIII, 1922, pp. 160-171. 
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vector DR has been called by Bianchi associated, and its direc- 
tion and length the direction and curvature associated point 
by point with the vector R{s), We prefer, however, the quali- 
fication derived, because in Euclidean spaces DR is precisely 
the vector commonly known as the derivative of R with 
respect to the arc s of T. In fact, if we assume Cartesian 
co-ordinates, the Christoflers symbols vanish, and the pre- 
ceding expressions for the (ordinary) components of 

reduce to — \ 
ds 

Returning to a general manifold if R(5) reduces to the 
versor which is tangential to the curve T, i.e. in particular 

• doO' 

if we find that we are again dealing with 

ds 

the vector p of geodesic curvature considered in the preceding 
section. 

It can be shown that in every case DR (if not zero) is perpen- 
dicular to R, and that it has other interesting properties demon- 
strated by Bianchi. For further details the reader is referred 
to the paper just cited in the footnote, or to the Appendix to 
Vol. II of the same writer’s Lezicni di gcomelria differenziale.^ 


27. Autoparallelism of geodesics. 


Analyiiically we may derive this property from the equations 
of parallelism by using the differential equations found above 
for geodesics. 

Let X denote a unit vector defined at all points of the geodesic 
under consideration and having everywhere the same direction 
as the geodesic. We shall show that X may be considered as 
undergoing a parallel displacement along the geodesic. 

Let its parameters be A. From the definition of these para- 
meters, and using the parametric equations Xi = x^(s) of the 
geodesic in question, we plainly have 


A = 


dx" 

ds 


X:. 


and therefore 


dA' 

ds 




^ Second edition. Bologna, Zanicbelli, 1923. 
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Now the aj’s and the rr’s are connected by the equations 
(47) (the characteristic equations of the geodesics). Substituting 
d)C 

A' and — for ±1 and these become 
as 

y = 7 + i) 0 . . (63) 

as 1 


or multiplying by ds, 

p" ds = dX^ + hji {jl, i] ^dxi = 0, 
1 


which are the equations exjjressing the parallel displacement of 
the vector X. 

It is worth noting that a comparison of (51') and (53) shows 
that the quantities p' ds are a particular case of the t’ ’s, the generic 
vector R being replaced by the unit vector X of contravariant 
components x,. There follows immediately the contra variance 
of the quantities or, which comes to the same thing, the co- 
variance of the quantities p^, which we proved directly in § 25. 


28. Remarks on the case of an indefinite ds^. 

We agreed (§ 20) to say that an «-dimensional F„ is metrically 
defined when there is associated with it a differential quadratic 
form, with real coefficients a,*,, 

n 

<f> — a^jf. dx^ dxjf, 

1 

We then introduced the hypothesis that 4> is definite and posi- 
tive, and this is the only case we have considered in the fore- 
going sections. We now propose to make some remarks on the 
case in which <f> is still supposed irreducible (or such that its dis- 
criminant a is not zero), but is no longer definite, being capable 
of taking positive values for certain sets of differentials dx^ and 
negative values for certain others. 

In this case also, fixing a generic point P of co-ordinates 
X,; and an infinitely near point P' of co-ordinates x^ + dx^, we 
put 

n 

ds^ = <j> = 0,^ dx^ dXy.j 


(54) 
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and we shall call ds^ (which can now be positive, negative, or 
zero) the square of the line element (the distance), or better the 
interval between the two points P and P'. 

Among the (real) systems of differentials dxi, or, as we shall 
say, considering only ratios, among the directions drawn 

from P, there are which satisfy the quadratic equation 

^ 0 (55) 

For a moment we shall interpret the differentials dx^ as 

referring to Cartesian co-ordinates with origin P. Then the 
directions just defined, which are said to be of zero interval, 
constitute a quadric cone of vertex P. This cone divides the 
sheaf of directions drawn from P into two regions, in one of 
which 

ds- >0, (56) 

and in the other 

< 0 (57) 

All the directions in the first region are said to be of the first 
kind or tvtnelike (the term being suggested by the interpretation 
given to these symbols in the theory of relativity); those in the 
second region are said to be of the second kind or spacelike. The 
parameters of a direction of cither kind are defined by the for- 
mula) 

ft 7 * 

A‘ (i = 1, 2, . . . n); . . (58) 

there is no analogous result for the directions of zero interval 
corresponding to which ds^ = 0. 

For timelike directions ds^ > 0; hence, if ds denotes the 
arithmetic value of the square root of ds"^, we have | ds | ds, 
and the argument is exactly as it was for the definite quadric. 

For spacelike directions, on the contrary, we have 

n 

I ojs® 1 = — dx, dx^., 

1 

SO that the quadratic identity satisfied by the parameters A* 

is 

A* — 1, . 

1 


(59) 
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with — 1 on the right, instead of -J- 1 for the timelike direc- 
tions. 

Granted these results, the systematic extension of the pre- 
ceding sections to the indefinite case would certainly not seem 
likely to be difficult. As we are not aware that this has yet been 
done exhaustively, the reader’s attention may be called to it. 
We propose merely to point out an essential fact, almost evident 
a 'priori and often used in the theory of relativity; namely, that 
the definitions, geometrical representations, and formulas in the 
foregoing sections can certainly be carried over and applied to 
the indefinite case, provided (a) that we take account of the 
exceptional behaviour of the directions of zero interval, and 
(Jb) that we make the obvious formal modifications necessitated 
by (59) when we have to deal with spacelike directions. 

We leave the matter here,^ with two examples to conclude 
th e disc u ssion : 

(1) The condition that two directions, whether timelike or 
spacelike, of parameters X\ ijl\ may be orthogonal is in every 
case exj^ressed by the erpiation 

SaaffrAV*' = 0. 

X 

(2) If we consider only lines wholly composed of timelike 
elements > 0), the discussion in § 24 holds without modi- 
ficjation, and we reach the same equations (47) of the geodesics. 


Bee Cbai»ter XI, p. ‘287. 



PART II 


The Fundamental Quadratic Form and 
the Absolute Differential Calculus 


CHAPTER VI 

Covariant Differentiation; Invariants and Differentiai 
Parameters; Locally Geodesic Co-ordinates 


1. Covariant dillerentiation. 

Returning to the remarks made on p. 8G of Chapter IV, 
we now propose to generalize tlie operation of differentiation by 
substituting for the ordinary derivatives of the elements of a 
tensor certain linear combinations of these derivatives and of 
the elements of the given system, which will in their turn con- 
stitute a mixed (or in particular, covariant) system with one 
index of covariance more than the given system. Explicitly, if 
A^‘ ' ' ' / is the given generic system whose elements are functions 
of the x’s or, in geometrical terms, functions of position, we shall 
deduce from it another system | where I is a new index of 

covariance, which reduces to the system ~ - ? in the particular 

case when the co-ordinates are Cartesian. ‘ 

To simplify the formulae, we shall consider first a mixed system 
Af with a single index i of covariance and a single index h of 
contra variance. 

Fixing our attention on a specific point of F„ (i.e. ignoring 
the fact that the A’a are defined as functions of position), we 
know that the law of transformation of the functions Ai for 

m 
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a change of variables is defined by the invariance of the form 

F = .... ( 1 ) 

1 

in which the ’s constitute a generic contravariant system, or, 
in other words, are the contravariant components of a generic 
vector 5; similarly the UjJs can be considered as the covariant 
components of a generic vector u. 

Now, since a set of values of the is associated with 

every point of F„, we can at every point choose two arbitrary 
vectors u, and construct an invariant form with them and 
the A's, 

Suppose this choice made at an arbitrary but determined 
point P, and consider also a generic point infinitely near to 
P. We shall agree to take for ^ and u at Pj the vectors parallel 
to those chosen at P; as the displacement is infinitesimal, the 
curve of displacement is immaterial. We sliall use the operator 
S to denote in general the increment of a quantity in passing 
from P to Pj, and we fjropose to calculate 8F, Differentiating 
(1) with the operator S, we have 

8F = + + 

1 

Now, by the convention just adopted as to the vectors % 
and u, the differentials 8$^ and must be calculated by the 
formulae of parallelism ( (52) and (52'), pp. 188, 189), while 
8A^l is given by the usual rule of differentiation 

n 

sAf = sx,. 

the.4’s being by hypothesis functions of position. Using these 
results, we have 

n Ah n 

SF = “a — S;*;, A^ {jl, i } Sxi 

1 dxi 1 

+ ^ihji - 4 ? { M, j } Uj Sxi. 

Interchanging i and j in the second sum and A and j in the 
third, so as to get the factor in all three sums,, and 

( D 666 ) ^ 
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collecting all the terms under a single summation sign, we have 

Now the left-hand side of this equation is invariant on account 
of its meaning, while Sxi^ U/, are arbitrary contra variant or 
CO variant systems; hence the coefficients of this form (the expres- 
sion in square brackets) constitute by definition a system which 
is covariant with respect to i and I and contra variant with respect 
to h. We can therefore put 

p) Ah n n 

{A]\ = 'r* • ( 3 ) 

OX^ V \ 


This system is called the cxyoariant derivative of the system 
A^. It is sometimes denoted by the symbol and also, when 
no ambiguity is possible, simply by 

It is obvious that in Cartesian co-ordinates (which exist when 
we are dealing with Euclidean forms; cf. § 21 of Chapter V, 
p. 121) the system reduces to that of ordinary derivatives. 

The method used above can be appln^d, mutatis mutandis^ 
to a generic mixed system. We shall always get for hF (as follows 
at once on carrying out the necessary oj)erations) a multilinear 
form whose coefficients we shall define as elements of the co- 
variant derived system. These coeificients consist of a first 
term which is the ordinary derivative, followed by as many 
terms preceded by the minus sign as there are indices of co- 
variance of the given system, and as many terms preceded by 
the plus sign as there are indices of contravariance. If we denote 
by (i) the aggregate of indices q . . , and by Qi) the aggregate 
hi .. . the general formula is ^ 


A 


ih) _ 
(i)\i — 


p) in II 

dx, *<• 




• • V- Wl • • - ^ ^ ‘ 


( 4 ) 


^ Cf. A. Palatini : “ Sui fondainenti del Caltudo DilftTeiizialc assoluto ”, in 
Rend, dd Circolo Mat, di Palermo^ Vol. XLIII, 1919, pp. 192-202. Another 
vectorial illustration of covariant dififerentiatioii was j^ven by the late Prof. 
HKSSKNnwKU in his paper “Voktorielle BegrUndung der Difforentialgeometrie ”, in 
Math, Ann,, Vul. 7S. 1917, pp. 187-217- 
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2. Particular cases. 

Consider first a covariant simple system -4,-, which we can 
always interpret as consisting of the covariant components 
(moments) of a vector A. In this case the terms contributed by 
the indices of contravariance are absent, and (4) (or (3) ) gives 

It • • • (5) 


It is easy to see that this double system is not in general 
symmetrical; from (5) however we get at once the important 
relation 


A, 


i\l 


A, 


1^ 


dA,__dA,^ 

dxi d X, 


- • ( 6 ) 


The vanishing of the co variant derivative A^^i has a simple 
geometrical significance. In this case, multiplying (5) by we 
have 

7) A . « 

• dxi 'Li[il,j)Ajdxi, 

CXi 1 

comparing this with equation (52') of the preceding chapter, in 
which we supjjose all the to vanish except the Zth, we see 
that it expresses the fact tliat the vector A undergoes a parallel 
displacement along the line 1. 

Analogously, for the derivatives of a contra variant simple 
system 4', we have 

A\, . . . (5') 

O Xf 1 


Next, consider a system of order zero, i.e. an invariant/. In 
this case (4) becomes 


ft 


df 

a®/ 


(7) 


or tlw covariant and the ordinary derivatives are identical. If we 
construct the system of covariant second derivatives, applying 
formulae (5) to (7), we shall have 


f = 

J Vk ^ ^ 

w o 




¥ 


<dx; 


( 8 ) 
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these are not the same as the ordinary second derivatives but, 
like them, are symmetrical. 

For a covariant double tensor (4) becomes 

.1«|, = 4“ - Sj {<i, 3 } A,, - S, K, j) A,y, (9) 

OX^ 1 1 

and for a contra variant double tensor it becomes 

^ Aik n n 

(A <»') 

OXf 1 1 


3. Ricci’s lemma. 

If formula (9) is applied to the system of the coefficients 
of ds^, we get, remembering the expression for the derivatives 
of these coefficients in terms of (^hristofiers symbols (Chap. V, 
§ 16, p. Ill), 

- 0 (i k, I - 1, 2, . . . n), . (10) 

This important theorem, that tJie covariant derwatwes of the 
coefficients a^^ cltc zero^ can be proved directly from the definition 
of covariant differentiation. To do so, we must choose two 
arbitrary vectors 5, >]> and construct the expression 

F = 

1 

we then calculate hF corresponding to a parallel displacement 
of the vectors 5, tj, and we shall get a trilincar form in f \ 'if, hxi, 
whose coefficients, by definition, will give the requii*ed derived 
system. 

Now F is merely the scalar product of the vectors ^ and yj, 
which, as we know, is not changed by a parallel displacement; 
hence we shall have S-F = 0 for any values whatever of 5? 
and Sec’s, which means that all the coefficients of this form vanish 
identically. 

Similarly we can show that the covariant derivatives of the 
reciprocals a*'' vanish; in this case w^e have to use the expression 
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which is again the scalar product of the (arbitrary) vectors 
u and V. 


4. Contravariant differentiation. 


There is in the absolute differential calculus a kind of law 
of reciprocity or duality in accordance with which we can deduce 
from every theorem or formula a reciprocal theorem or formula, 
by interchanging the words covariant and contravariant, and 
lowering or raising the indices. We have already had several 
examples of this; we shall now make some brief remarks on the 
operation of contravariant differentiation, which corresponds to 
that of CO variant differentiation just described. 

The shortest way to deduce from a system the system 
which has the properties reciprocal to those of the co- 
variant derivatives, is to find the covariant derivative of the 
given system and then compound it with the system of the 
i.c. to make 


MOk 


n 




ih) 


We could find for this system an expression analogous to 
(4) and properties corresponding exactly to those of the covariant 
derivatives; or we could find these properties directly from those 
of the co variant derivatives, by using the foregoing formula of 
definition. We shall therefore not pursue the argument in detail, 
and shall instead resume our discussion of the fundamental 
properties of covariant differentiation. 


5. Conservation of the rules of the ordinary differential 
calculus. 


First, consider a tensor, in general mixed, which is the sum 
of two others of the same rank and species, i.e. 



It will at once be seen that the covariant derivative of the 
system A is obtained, like an ordinary derivative, by adding 
together that of B and that of C, or 





( 11 ) 
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This formula follows either from the linearity of (4), or from 
the consideration that the form F relative to A is the sum of a 
form relative to B and a form relative to C, so that a similar 
result holds for 8F; the coefficients of the latter expression 
(which are by definition the derivatives will therefore be 

the sums of the corresponding coefficients of the other two 
(which are by definition the derivatives and The 

reasoning can be extended without difficulty to a sum of any 
number of terms. 

Next, consider the derivative of a product. If are 

two generic tensors, we shall denote their product by 


Ah) 

Ai) 


= B] 


ih') AJn 


xn 


'in » 


where the symbol (i) stands for the aggregate of the indices 
(i') and (i") together, and similarly for (A). We shall show that 



B 




^(h") 






,(/*') ^(A') 


( 12 ) 


To simplify the formula? we shall suppose that the systems 
A and B have each only one index of covariance and one of 
contravariance. We know (Chapter IV, § 8, p. 78) that if 

ifs 'Lcfl;. rf' V,,. 


are the invariant forms for the systems B and C, that for the 
system A is 

F -= <^0. 

We shall therefore have 


8F ~= ip8(f> <f>8ij/y 

and equating the coefficients of | ‘ Vf^n 8xi on both sides 

of this equation wc get equation (12) (for tlie particular case 
considered). 

Now consider the derivative of a compounded mixed system 
(Chapter IV, p. 79) 

^(0 — ^(rXs)^(0(r) 


( 13 ) 
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where (*) and {h) have the meanings already explained, and 
(r) and (s) denote the aggregate of all the indices afiected by the 
process of contraction. We shall show that 




(14) 


In particular, if each aggregate reduces to a single index 
and if the j)roccss of contraction is applied only to one index, 
(13) becomes 

A'^ - (13') 

1 

and (14) becomes 

= i, [lit, c;"' + Bt c]i]. . . (14') 


We shall give the proof for this simpler case, merely point- 
ing out that it can be immediately extended to the general 
case. 

We start from the invariant forms relative to the systems 
B and O 

1 

I 

where we have followed tlie *samc procedure as in Chapter IV, 
j). 78, and introduced a set of ft contravariant systems 
(a — 1, 2, . . . n) and the associated reciprocal set. The 
invariant form 

has the A^s as coefficients, as we saw in Chapter IV. 

Applying the symbol of operation S to this we get 

8F = + 

1 

and equating the coefficients of r)-^ on both sides of 

this equation, we get (14'). 
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To sum up, we have shown that the fundamental rules of 
ordinary differentiation hold good for covariant differentiation. 

6. Applications. 

We note first of all that if we start from a generic simple 
system (a function of position), say a covariant system 
and consider its reciprocal V\ we have by definition 

F' = 

1 

hence, taking the covariant derivative and using Ricci’s lemma, 

vu - ( 15 ) 

We shall next calculate the covariant derivative of the scalar 
product X of two vectors, which, as we know already, is identical 
with the ordinary derivative. 

Let U, V, be two generic vectors, and put 

X ^ V X V ^ iiUi V\ 

1 

Taking the covariant derivative, we have 

X, - [Z7,„ F' + U, Ff 

1 

In the second term on the right we can replace Fj/ by the 
expression for it in (16), so that 

S, U, V\, - U, = S, U'^ F,„. 

1 1 1 

Changing k into i, and substituting in Xi, we get the formula 

JC, - S,[t/.„F*+ C^' F,|J, . . . (16) 

1 

which is often used. 

In particular, if V = U, wc have X = TJ^, and therefore 
jf, = 2U^-? - . . (16') 

^x^ X 
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7. Divergence of a vector and of a double tensor. oX 
an invariant. 

Take a covariant simple system X^, wliicli we can always 
think of as the aggregate of the components of a vector X, and 
construct the invariant 

0 =- (17) 

1 


where the terms denote covariant derivatives. In the 

particular case of the fundamental form being Euclidean, we 
have = S^, and also the covariant and ordinary derivatives 
are identical; hence in this case (17) becomes 


0 = 



In three dimensions this expression is called the divergence 
of the vector X. We shall extend the use of this term to the 
general case (17). 

We can transform (17) by means of (15). Writing X instead 
of V, (15) becomes, for I - i, 

;ci, 

1 

Summing with respect to the right-hand side gives 0, 
as can be seen at once from (17) by putting I instead of k and 
then interchanging I and i. Hence we have 

e (17') 

1 


From the general rule for covariant differentiation, or more 
specifically from (5'), we have 




+ ^ { Oh XK 


Now sum with respect to i. 
left, and from the identity 


Sf {ji. i) •-= 
1 


Substituting from (17') on the 

d^a 

\/ a dxj 


( D 655 \ 
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(cf. formula (26), Chapter V, p. 112) on the right, and writing 
I as the index of summation on the right instead of i and j, we 
get 


(-) 


“ /' 


'dX‘ I X 

dxi s/a 


) 


or, taking the factor 


1 

s/ a 


outside the summation sign. 


0 


_1 0 
\/a 




(17") 


This expression for the divergence is completely equivalent 
to the formulae (17) and (17'); it is more useful for purposes of 
calculation, while (17) and (17') on the contrary are more suited 
to theoretical discussions. 

In particular, consider the case where the vector in question 
is the gradient of an invariant Uy i.e. where 

{i 1, 2, . . . n). 

dxi 


In this case the divergence is denoted by the symbol AgW 
and is called the second differential parameter of the function u\ 
the expression for it can be deduced at once from (17) or from 
(17"), using in the calculations the fact that 


We thus get 


v; = S, a' 


V 


Sit o“ Ui, - Wa v}). 


(18) 


both these expressions being generalizations of the ordinary 
expression for Ag in Cartesian co-ordinates. 

Next, take a contravariant double tensor We note 

first of all that if instead the given tensor were covariant 
or mixed (Xf), we could always compound it with the a^^s and 
so obtain an associated tensor in which both indices are indices 
of contravariance; so that the choice of a contravariant tensor 
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does not really constitute a restriction. From this tensor, taking 
the covariant derivative and applying the process of contraction, 
we get the contravariant simple system 

(19) 

1 

which, by an obvious analogy with the former case, is called the 
divergefice of the gwen double tensor. If the process of contraction 
were applied to the first instead of to the second index, we 
should plainly get a contravariant system 

n 

V * 

^ I k9 

1 

in general this is distinct from the divergence Y\ coinciding with 
it only in the particular case wlien the given tensor Jl*'* is sym- 
metrical. Vice versa, if X^J^ is the system recifuocal to X'^ (the 
indices corre8])onding in the order written), we see at once from 
the rules in § 5 that the system 

Y, - 

is merely the co variant system reciprocal to (19). Returning 
to (19), it should be added that the expression on the right 
cannot in gen<'Tal be transformed (as was done for the ordinary 
divergence (37) ) into an expression which is convenient for 
actual c^ilculations. In tlie case of an antisymmetrical tensor 
(X'^^ + X'^' = 0), however, the analogy in this respect is per- 
fect. In fact, if we substitute in (19) the values of Xfj^ given 
by (9'), the second term on the right vanishes from the anti- 
symmetry of the X’s, while the other two give 

n vik n 

+y:;,{3k,Tc}TK 

1 ox,, 3 ' 

From this expression, by the same method as that just used 
to ]jass from (17') to (17"), we get the equation 
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8. Some laws of transformation, e-systems. Vector product. 
Extension of a field. 

Consider a set of n covariant simple systems A„|f (where 
a is the ordinal number of the system and i the index of 
covariance) and the determinant of the set 

V = 1|A„|J|. 

Changing the co-ordinates from the x’s to another set of 
variables x, the systems are transformed (in accordance 
with the law of covariance) into another set of systems A^j^- 
Construct the determinant of these new quantities 

V - ||A,,.1|. 

We shall show that the relation between V and V is 

V - VA (20) 

where D denotes the Jacobian determinant of the transformation, 
i.e. 

2) ^ • • • ®n\ 

^2 ... 

which is of course not zero, it being always supj)oscd that we are 
using a reversible transformation (§ 2, p. 3). The relation (20) 
can be verified at once if we construct the product by rows of 
the two determinants on the right, viz. 


ll 

^1 12 

• • • ■^ll« 


dx^ 

dx^ 

dx^ 

dxi 

d 

d 

^2|1 

^212 

• • • 


dx^ 

dx^ 

3*2 

3 *,. 

3*2 


^i |2 

• • • 1 n 

1 

dx^ 

dx„ 

3*2 

3*„ 



( 21 ) 
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we see at once that the elements of the product determinant are 
precisely the quantities 

It is also useful to examine the behaviour of the discriminant 


a II II 


of the fundamental form when we change from the variables 
X to the variables x. For this purpose, we take the trans- 
formation law of the coefficients afj^ (§ 12, p. 85), 


A) 


ife 






dx, 


3 a’/,. 


putting 


hr 


n 

— ^Jh 

1 



- • ( 22 ) 


we can write this law in the form 

“ _ ^ r dx, 

■■■ 


This law, which is completely analogous to (21), enables us 
to conclude at once, from the example of the preceding case, 
that the relation between d and the determinant b of the quan- 
tities bjf^ is analogous to (20), i.e. that 

d -- bD, (28) 


Further, as (22) is of the same type as (21), the determinant 
h will be connected with a by the relation 

6 = aZ>, 

which, combined with (23), gives us the required relation between 
a and d, namely, 

a ~~ aD^ (24) 


It follows from (20) and (24) that the ratio is an absolute 


invariant, i.e. that 


\/ a 


V _V 

\/ d \/ a 


?3trictly speaking, this equality holds except for sign; but if 
we agree to change the sign of the radical when a transformation 
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is made for wliich D is negative, it holds in sign as well as in 
numerical value. 

The remark just made leads us to define a particularly useful 
tensor whose elements can be expressed in a very simple form. 

In fact, we note that the quantity which we have just 

s/ a 

seen to be invariant, is merely a midtilinear form in the n sets 
of variables A„[;; this is seen at once by expanding the deter- 
minant V in the usual way, as the sum of products of its elements 
n at a time, where no product contains two elements from the 
same row or column, and with the usual rule as to sign. We 
may write the result in the form 


± . . . A„| 


where the symbol S denotes the sum of all the possible pro- 
ducts, subject to the conventions stated as to their structure 
and sign. Since this form is invariant, its coefficients constitute 
a contra variant system. If we put for the coefficient of 

the product we see at once that we have: 

.In „ 0 jf least two of the indices 
equal; 

__ ^ j£ £ijese indices arc all different and con- 

s/ a 

stitutc a permutation of even order with respect to the 
fundamental permutation 1 , 2 , ... w; 

gti *2 ■ ■ . ~ — V indices are all different and constitute 

a 

a permutation of odd order. 

Hence it follows that the system of order n whose elements 
are 0 , — y_, respectively according to the rules just stated, 

is contravariant’, we shall call it the contravariant ^-system,. 

We can give an analogous definition of the covariant e-system 
by considering the determinant (reciprocal to V) 

V - II Aid 
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constructed from the reciprocal elements of the systems in 
the determinant V; these elements, as we know from Cfiapter 
IV, p. 74, constitute a set of n contra variant simple systems. By 
a well-known theorem, which can at once be verified, we have 

VA -- 1; 

hence the quantity v^a A ^the reciprocal of ^ ^ will be invariant. 
Expanding the determinant A, this can be written in the form 

v»s,, ,,.±VaS‘...a;,-, 

where the symbol S as bfjfore denotes the sum for all per- 
mutations of the indices i. 

It follows that the system whose elements are zero 

if the indices are not all different, and are equal to 

\/ a or — ^ a if the indices are all different according as the per- 
mutation ^2 • • • even or odd order, is covariant. 

The use of the same letter e for both is justified by the fact 
that this covariaiit system is the reciprocal of the former system. 
This statement can easily be verified by the reader.^ 

By means of the e-systems, when n — 1 vectors v„ (a ^ 1, 
2, . . . n — 1) are given, we can deduce from them (by invariant 
processes) an nth vector w, which is called their vector product, 
as in three-dimensional Euclidean space it is identical with the 
ordinary vector product. If and ^ 1, 2, . . . n) denote 

the contra variant and covariant components of the n — 1 given 
vectors, the formulae 


Wj — 

n 

1 

. in-1 ii ia . 

• ^1 ^2 • • • ^n-l» 


n 

i I'l 12 • • 

Vn-X ^ 


w'’ — 


• 


define two reciprocal systems, as can easily be verified; hence 
either separatedy defines the same vector, which we call w. 
When w 3 and the space is Euclidean the components of w 
do in fact reduce to those of the ordinary vector product. 

’ For Ibis and other j>r<»porties of tho e-systems, cf. an interesting note by 
niPKA : “Sui sistemi E nel eah-olo difFerenziale assolnto ”, in lieiid. della H. Aco. 
dei Lined, Vo]. XXXI (first half-year, 1922), pp. 242-245. 



i6o ABSOLUTE DIFFERENTIAL CALCULUS 


In any case it follows from the preceding definition of the 
components (or vf) that w — 0 if the vectors are not all 
linearly independent, i.e. if the characteristic of the matrix com- 
posed of their components Cot — 1; when they are independent, 
w 0 and is perpendicular to every v^. The latter property 
follows from the consideration of a generic vector product w X v^. 
Taking, say, the first group of formulas, we have 


WXV„ 


n 

1 


1 




V' V, 


2 


• c; <■ 


whicli is zero from tlie definition of the e-system, or, in other 
words, because the sum is the expansion of a determinant with 
two rows the same. 

Lastly, we wish to introduce into the metric of a the 
notion of the extension of a field, i.e. to define, for a given field 
of a quantity V analogous to the area of a poi-tioii of a 
surface or to the volume of a three-dimensional field. Evidently 
we have a priori a free choice as to the definition of dV, provided 
that when n - 2 it reduces to the expression already given 
for the element of area (formula (17), p. 99), and that when 
n — 3, in Cartesian co-ordinates, W(^ have dV dxdydz; 
further, from the geometrical meaning of the term, the extension 
F of a field must be an invariant.^ All these conditions are 
satisfied if we assume 

dV %/ a dxy . . . dx,,^y . . . (26) 

where s/ a denotes the arithmetical value of the radical, and 
therefore 

F = j ^\/ a dx^ . . . dx^^. 


We know in fact that on a change of co-ordinates the pro- 
duct dx^ dx^ • - . dx,^^ must be replaced by | Z) [ dx^ dx^^ . . . dx^^. 
From (24), extracting the square root, and taking the absolute 
values of both sides of the equation, we get 

I I . I D I dx^ dx2 . . . dx,^ -= %/a dx^ dx^ . . . dx,^. 

^ A detailed study of the concept of extension and of its nnalytical expression has 
recently fjeen made by O. Holder. Of. “ Das Volumen in eiiier Riemoim'scben 
M^nni^faltigkeit in Math, Zeitschrift, Vol, 20 (1924), pp, 7-20. 
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But the left-hand side is V^a dx^ dx ^ . . . dx^^ which is there- 
fore invariant. 


9. Botor of a simple tensor in three dimensions. 

We can now give a definition of the rotor (or rotation, or 
curl) of a vector X given as a function of position, which shall 
hold good both when the space considered is not Euclidean, 
and also when it is Euclidean but the co-ordinates are not Car- 
tesian. For any value of n, the generalization consists in defining 
as the rotor the covariant double system 

Pd = ^i\l — ^l\i9 


which is obviously antisymmetrical, since p^^ = 0 identi- 

cally. As we saw in § 2, the p^s can also be written as the differ- 


ences of the ordinary derivatives 


dxi 




if then we con- 


sider the A's as coefficients of a Pfaffian 




0 = 

1 


it- will be seen that the /^’s are merely the coefficients of the 
bilinear covariant of this Pfaffian (cf. Chapter II, p. 20). 

To get the full analogy to the ordinary rotor, however, we 
should consider a space of only three dimensions. For n = 3, 
there are three different elements p^i — —-pu^ corresponding 
to the pairs of different suffixes 23, 31, 12, pairs of equal suffixes 
giving zero values of the p"s. Each of the pairs 23, 31, 12, can be 
associated with the absent suffix (1, 2, or 3 respectively), or, 
in a general formula, the index h can be associated with the 
pair of the type h -\~ 1 , A + 2, with the convention that sufl&xes 
which differ by 3 are to be considered equivalent; for instance, 
if h 2, A + 2 represents the suffix 1. It is therefore easy to 
understand how when ?i = 3 the rotor can be represented by 
a simple instead of a double system. If, however, we were to 
put 

P/i ~ Pa + 1, /i + 2> 

the simple system so defined would be neither covariant nor 
<jpntravariant. Instead, it will be convenient to apply the tenn 
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rotor to a vector B whose contravariant components are 
defined as follows (with the help of the e-systems introduced in 
the preceding section): 

R’‘ = ia ^ 1’ 2, 3). 

1 

The contra variance of follows immediately from the prin- 
ciple of contraction. In order to see the analogy between this 
expression and the ordinary rotor, note that in the double sum 
i and I can take only the values h 1, h 2 (since the € corre- 
sponding to the value h would be zero); since i and I must also 
be unequal, there are two possible cases: 

i = h Z = A + 2, when 

va 

i -j- 2, Z = h + 1, when ^ 

s/ a 


Hence this sum will have only two terras, and R^' can be 
written in the following form; 


or 


Va 


/t+n 


ft+2) 




1 . ; 


the latter being convenient for actual calculations. In Cartesian 
co-ordinates a — 1, and we get the ordinary expression for the 
components of a rotor (it being supposed that x^, x^, x.^ corre- 
spond in order with x, y, z). 


XO. Sections of a manifold. Geodesic manifolds. 

We know that in ordinary space if we are given two direc- 
tions X, fJL starting from the same point P and defined by their 
covsines X\ (i — 1, 2, 3), every other direction ^ through 
P whose cosines are linear combinations of those of X and [jl, 
i.e. = pA* -f- o/r', lies in the plane determined by X and {i. 

The coefficients p and a are of course not independent, as the 
^’s must satisfy a quadratic identity; we have in fact 

cr^ “f“ 2 pa cosXfir = 1. 
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The directions ^ so define<l are therefore simply infinite in 
number, and their aggregate is called a seclian. 

All this can easily be extended to a generic F„, in which 
m directions (a — = 1,2,... m) are given. 

Take m multipliers p„, for the moment arbitrary, and con- 
sider the directions 5 whose parameters are 

in 

(27) 

1 

and consequently whose moments are 

m 

(27') 

1 

In order that those expressions may effectively represent 
parameters and moments respectively, it is necessary and suffi- 
cient that they should satisfy the relation 

kvi; h 

1 

in 11 

that is to say, S, A,', A^ , , = 1, 

I 1 

A 

or, denoting the angle between the direction and X^ by ajS, 

in A 

Ka Pa pp cosa^ --=1 (28) 

1 

Now suppose that the p’s are connected by this relation but 
are otherwise arbitrary. We then see that (27) (or (27') ) defines 
an aggregate of directions (this being the number of arbi- 

trary parameters), including in particular the m given directions; 
this aggregate is called a section. 

A section G being defined in this way by means of m of its 
directions X^, take in it any m directions X'^ whatever (a :=== 1, 
2, . . . m). It is almost obvious that the section G' determined 
by these directions is again G itself. 

This can of course be verified algebraically. In fact, if a 
direction ^ belongs to G\ this is equivalent to saying that its 
parameters are linear combinations of the parameters and 
therefore also of the parameters i.e. the direction ^ also belongs 
to G\ and vice versa. 
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We saw in Chapter Vjp. 130, that a geodesic is uniquely deter- 
mined if its starting-point and direction are given. Now let us 
fix a point P in a and draw from it two directions X, (x; 
these will determine a section of cx)^ directions drawn from P, 
Consider the go^ geodesics drawn from P in all these directions: 
they constitute a surface (oo^ points) which is called a geodesic 
surface with pole P. 

A geodesic surface is therefore determined by a point and 
two directions. 

A similar definition can be given of an m-dimensional geodesic 
manifold. Take a point in F„, and m directions drawn from it, 
which will define a section of 00 "*“^ directions, and construct the 
geodesic corresponding to each of these directions. Since each 
geodesic contains 00 ^ points, the aggregate of all of them will 
contain 00 *'’“ points; i.e. it will constitute a manifold which 
we shall call a geodesic manifold. 

Particularly important cases are the geodesic surfaces (m — 2), 
and the geodesic hypersurfnccs (m n — 1) determined by n — 1 
directions drawn from a point; we shall use these in the following 
section. 

11. Locally geodesic (or locally Cartesian) co-ordinates. 

In general, a system of co-ordinates in which ds^ is repre- 
sented by a form with constant coefficients is called Cartesian. 
It is not always possible to choose co-ordinates of this kind in a 
given F,,; it is however always possible to find a system of 
co-ordinates which behave like Cartesians in the immediate vicinity 
of a point P assigned beforehand, or, more precisely, which are such 
that the derivatives of the coefficients of ds^ (which would vanish 
identically if the co-ordinates were Cartesian) all vanish at the 
point P. Such co-ordinates are called locally geodesic, or locally 
Cartesian, co-ordinates. 

Their interest from tlie point of view of parallelism, or more 
generally of elementary equipollent displacement, appears plainly 
from equations (52) and (52') of Chapter V, pp. 138, 139, which 
define the increments of the contra variant and co variant com- 
ponents respectively. It follows from these equations that when 
the system of reference is geodesic at P, these increments, in 
passing to any very near point, are zero, precisely as are those 
of the ordinary Cartesian components in Euclidean spaces. 
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Now take a F,„ and in it any system of co-ordinates ®; we 
propose to introduce — if tliis is possible — a new set of vari- 
ables 

= fi (»i. a?o, . . . »„) {i = I, 2, ... n) . (29) 

such that the cr’s are geodesic co-ordinates at P, or in other words, 
putting for the coefficients of in the new variables, such 
that 

0 (i, *, 1 - 1, 2, . . . n), . (30) 

\dxi/ p 

where the use of the suffix P denotes that after differentiation 
the f’s are to be replaced by the co-ordinates of P, Remember- 
ing the definition of Christoffel’s symbols (Chapter V, pp. 109, 
110), wc see tliat (30) is equivalent to the condition that these 
symbols themselves are all zero at P, i.e. that 

{jl, i\j. ^ 0 (j, I, i ^ 1 , 2 , . . . n). . (30') 

The following analysis shows the possibility of finding a 
set of functions to define a transformation of this kind. 

The condition (30) consists of n^ln(n~}- 1) equations con- 
taining the first and second derivatives of the /’s (since a,;/,, by 
the law of covariance, cun be expressed in terms of the 
and the first derivatives of the /’s). Now the number of first 
derivatives is and that of vsecoiid derivatives is n-^n(n-\- 1), 
so that the number of both together is greater than the number 
of equations. Since, as w^e shall see, the equations are not 
algebraically inconsistent, it follows that we can solve the 
equations (30) for the values at P of the first and second 
derivatives of the /’s, or rather for some of them, the others 
remaining arbitrary; further, the behaviour of the functions 
at points other than P is a matter of indifference. Thus 
the choice of the /’s can be made with a wide degree of 
arbitrariness. 

To avoid, however, the direct discussion of the equations (30), 
we shall start from the ideas contained in § 26 of Chapter V, 
p. 138. We saw there that the expressions 
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constitute a contravariant simple tensor, the vector % and there- 
fore its contravariant components and also the differentials 
dxi, being all arbitrary. This holds in particular for the 
hypothesis -- dx^, i.e. when we suppose 

t' d^ X, [jl, i\ dxjdxi. . . . (31) 

1 


If on changing the variables we have at a special point P 


dx, 

dx^ 


= s; 




(32) 


then at that point, from the law of contra variance 


it follows that 



dxi 

dx^’ 


T ~ T . 


(33) 


If we suppose (as we arc always free to do^ by making a 
preliminary cliange of variables from to a?, + a constant) 
that the aj/s vanish at P, the equations (32) are satisfied pro- 
vided the formuhe of transformation (29) are of tlic type 

3 - 2 , .. . X„), . . (29') 


where denotes a function of the a:’s which is regular at 
and whose expansion in a series of powers of the a;’s begins with 
terms of at least the second degree, e.g. a polynomial of the 
second degree in the ;r’s. In fact, if those conditions are ful- 
filled, all the first derivatives of the <^’s vanish at P. The second 

derivatives are identical with the second derivatives 

d^x. 

— * ^ and give the terms of the second degree (by Maclaurin’s 
dXj dxi 


theorem) on the right-hand side of the equations (29'). By a 
suitable choice of the numerical values of these second derivatives 
at P, we can make all the Christoffers symbols for the variables 
X vanish, so satisfying the equations (30'), as we shall now show. 

In fact, writing out both sides of equation (33) in full by 
means of (31), and considering the a;’s, in virtue of (29'), as 
independent variables (with their second differentials zero) and 
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the x’s as functions of them, wo can write (33) in the form 


X, -f L/,* {M, i\dx,, dx„ = {jl, i}dXj dx^. 

Equating the coefficients of dXj dx^ on both sides and remem- 
bering (32) we get 

from which it appears that we need only take 


at P in order to have 

{jh i}i. = 0 

for every possible set of values ofj, I, i. 


Q.E.D.i 


^ Prof. Porini hat* rocciitly ostablisliod an important uxtenuion of thiu result by 
showifig that, given any curve wliateve'r, it is also possible to choose co-ordinates 
wliich are locally geodesic at every poitit of the curve. (^f. his notes “Sojira 
i fonomeni che avvcngoiio in prossimita di una linoa oraria” in Uend, della Jt. Arc. 
dei Lined, Vol. XXXI (first half-year, 19212), pj). 21-23, 51 -h2. Fermi’s result 
can be quickly justified as follows, by calculating the number of available unknowns 
and of conditions to be satisfied. 

Take the iH^uations of the curve L iii the form 

aJt — Xi(-^n) = 1, 2, . . . n 1), 

as we may always do by considering a suitably limited segrmait. Note first that 
if the values zi, of a generic function z{jciy x-jy . • , x,i) and of its partial derivatives 
with respect to arj, a%j, , . . J!»_i are known at all points of the curve, then tlie values 

t>f also are determined at all points of the curve. This is obvious if we take 
dxn 

tluj identity x^, . . . Xf^) = ZL{Xn) which holds at all points of Z, and dif- 
ferentiate it, so getting 

dz __ dzf^ _ dz dxi 

^ x<fi d'Xj,^ 


Now suppose that wg make a change of variables of the general type (29), and 
that we wish to determine, if possible, the ti functions f^{x^y x>y . . . x^) so as to 


make every - ^ 


0 along L. As has already lieen noted in dealing with a single 


point y^ we thus get n* Jn(n+1) conditions involving the first and second deriva- 
tives of tlui /'fl. Now the number of first derivatives is n-, and that of the 
aVi . 

second derivatives -s « is n •.\w(ti 4-1) ; but from the preceding remark, the 
GXhdxk “ a-/- 

of the latter, which are of the type — (fi h = 1, 2, ... n), can be expressed 

oxn dXk 
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It is not inapposite to give a geometrical interpretation of 
the conditions imposed on the co-ordinates x in order that (30') 
may be satisfied, or, in other words, in order that they may be 
geodesic at P. These conditions may be put in the following 
form: 

(а) The n co-ordinate hypersurfaces passing through P must 
behave as geodesic hypersurfaces with respect to points infinitely 
near P (or, in particular, must be geodesic everywhere). 

(б) If through a point P\ infinitely near to P and on one of 
the n co-ordinate lines through P — say that along which 
alone varies — we construct the direction parallel to another of 
the co-Ordinate lines, this parallel must belong to the co-ordinate 
surface Xf == constant which passes through P'. 

(c) When the co-ordinate hypersurfaces are fixed in accordance 
with the foregoing conditions (which, as is geometrically obvious, 
is always possible), the numbering of these surfaces (i.e. the way 
in which they are associated with the values of the parameters 
Xi, ^ 2 * • • • ^n) niust be carrievl out so as to satisfy certain numerical 
conditions which we shall subsequently specify, and which, as 
we shall see, can always be satisfied. 

That (a) and (6) are consequences of (30') follows immediately 
from the equations of j)arallelism and of geodesics. Recipro- 
cally, we shall show that a system of co-ordinates which satisfies 
the conditions (a), (6), (c) is geodesic at P. 

We shall begin by expressing the condition (a) analytically. 
Take a direction with parameters drawn from P and lying 
in the hypersurface constant (so thiit dx^ ~~ 0). Wc have 

to express the fact that the geodesic in this direction behaves at 


at points of L in terms of the othors and of the first dorivativos. Thert* remain 
altogether, including lK>th first and second derivatives, ?i,- + [M-in(n4-l) — 

= n-4n(n + 1) unknown functions of sp„ to determine by means of the same 

number of equations = 0. Those last equations, as can at once be seen, 

contain the secemd derivatives ^ ^ < '*'0 finite terms (in fact linearly), 

oxhdxk . . dfi . ^ 

while the unknown values of the first derivatives ^ — appear together with the 
7) -f ^ 

terms --- -- In any case we have a system of as many equations os there are 

dXfi oxji 

unknown functions of alone to determine. When the values of these deriva- 
tives are known on />, we can determine, with a wide degree of arbitrariness, func- 
tions fi which admit of these values. This can be seen by taking a Taylor expan- 
sion of the /’a as a function of the n — 1 arguments - xj, Xj — xj, . . . x,j_^ — xJi_|, 
where the quantities xj ±= 1, 2, ... n — 1) are the values of the x%^ o" 
the functions Xi(®n) which define this curve. 
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P as if it lay on this hypersurface, i.e. that vanishes along 
this geodesic. It follows that dxi = d^x^ = 0, and therefore, 
from the equation of geodesics, 

{ jl, i } j. dxj dxi = 0. 

Of the terms in this sum, those in which either or I, or 
both, are, equal to i vanish, since dx; ^ 0; tlie other being 
arbitrary, the necessary condition for the vanishing of the other 
terms is that 

® 0 *> ^ ^ 

We thus see the analytical meaning of condition (a). 

Next consider (6). We shall take P' on the line i, so that, if 
dx represents the increments of the co-ordinates from P to P\ 
wc shall have dx^ 0 for every value of I other than L Let 
X denote the direction of the co-ordinate lino j at P, so that 
0 for every Tc other than j, and let X undergo a parallel 
displacement from P to P\ Aj)plying the usual formula and 
remembering that dx^‘ and A' are the only components which are 
not zero, we get 

dX' ~ {ji,i}j.X^dx,. 

In order that the direction X' ~ A + dA may lie on the 
hypersurface - constant, we must have A'' — 0, or (since, 
as we noted, A' ~ 0 if 4^ j) dX' — 0 if i 4= j, so that wc must 
have 

I Jh i)p 0 (i 4= j)- 

Tliis is the analytical exj)rcssion of condition (6). We must 
novr use the third condition in order to show that the symbols 
with three equal indices vanish; we shall thus have exhausted 
all the type^ of Christoffel’s symbols. 

Suppose that thfs co-ordinates x satisfy the foregoing condi- 
tions. Apply a transformation which leaves the co-ordinate 
surfaces unchanged; this can be done by putting x, -- f,{x) 
(i.e. every a? is a function of a single x), or, which is the sam^ 
thing, 


dXi = Xi{x,)dx^, 
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where Xi denotes the derivative of / with respect to its 
argument. 

We shall now calculate the explicit expression of the symbols 
which we intend shall vanish. We have 

n 

1 

or, remembering that all the symbols are already zero except 
those with three equal indices. 

Substituting on the left-hand side the expression which defines 
the symbol of the first kind, we get 

i). 

dXi 

Hence the condition 

\ii, *■} — 0 (i — 1, 2, ... w) 
is equivalent to 



Now from the law of covariance we have 


and therefore 


- _ V _ V V 

1 ^ dx, dx^ 

da,, 


In order that the required condition may be satisfied, the 
functions X therefore need only satisfy, at P, the n numerical 
conditions 


da„ 


+ 2aaX,X[ 



otherwise they may bo completely arbitrary. 

We thus see how to determine a system of co-ordinates x 
which shall be locally geodesic at P. 
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12. Severi’s theorem. 

The possibility of choosing co-ordinates which are locally 
Cartesian at a given point enablers us to simplify the proof of 
some geometrical properties which hold in the neighbourhood of 
a point. As an example we shall prove, without any calculation, 
a remarkable theorem due to Professor Sever!.’ 

In a given F„ consider two infinitely near points, P and Pj, 
and a direction u drawn from P. This direction, and the direc- 
tion PPj detennine a section of F,„ and therefore a geodesic 
surface which passes through P and P^ and contains u. 

We can now give u a parallel displacement, from P to P^, 
in two ways: 

(1) by considering u as a direction in F,,, and therefore using 
the metric of this variety; this will give a direction Uj, which 
we shall call the ambiental paralM; 

(2) by considering u as a smface direction, belonging to the 
geodesic surface Fg just defined, and using the metric of Fgi 
this will give a direction Uj. 

Severi’s theorem is that % and uj are identical. 

We shall examine first the case in which F„ is Euclidean. 
In this case the geodesics arc straight lines (since, with a system 
of ('artesian co-ordinates y, Christofiel’s symbols are zero and 
the equations of the geodesics become — 0 (i - -- 1, 2, . . . «) ) 
and the geodesic surfaces arc planes; Severi’s theorem becomes 
an immediate consequence of the ordinary theory of parjillelism 
in Euclidean spaces. 

Next, if F„ is not Euclidean, we note that in the definitions 
of the ambiental parallel Uj, the geodesic surface Vo, and the 
parallel uj relative to Fg, the only metrical elements used are 
Christoffel’s symbols for the F„; since all these can be made 
to vanish by a suitable choice of co-ordinates, the two methods 
of displacement are applied exactly as if F„ were Euclidean, and 
therefore lead to the same result. 

’ “ Sullii curvatura delle sui>erficie o variety ” in Ttmd. dd Circolo Mat. di 
Palermo, Vol. XLII, 1917, pp. 227 2.'»9. Cf. f*Hi>ecially § 11. 
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CHAPTER VII 

Riemann’s Symbols and Properties relating to Curvature; 
Ricci’s and Einstein’s Symbols; Geodesic Deviation 

1. Cyclic displacement and the relations between parallelism 
and curvature. 

Scliouten,^ by his vector methods, and independently of him 
P&fes,^ by ordinary calculus methods, have demonstrated the 
great importance, for determining the geometrical properties of 
a of the displacement of a direction round a closed circuit; 
in particular the importance of infinitesimal circuits in investi- 
gating local properties at a generic point P, 

Consider a generic direction (a unit vector) u drawn from P, 
and give it a parallel displacement round a closed curve P of 
infinitesimal length so that it comes back again to P; after the 
displacement we shall have a direction u^, also clrawn from P, 
but not in general coinciding with u. The change in the contra- 
variant components due to the displacement round the circuit 
will in general depend on the arm of the circuit, on its configura- 
tion (i.e. on the orientation in V,, of the element of surface on 
which the circuit is drawn), and on the metrical properties of the 

at P. The influence of the last-named pro jjt».r ties is exert(;d 
through the first and second derivatives of the a.^’s; these 
derivatives occur in certain characteristic groups which are 
called Rieniann's symbols, and which are composed of Christoffel’s 
symbols and tlieir first derivatives. In the particular case of a 
surface, these expressions reduce to f)ne, wliich is that known 
in geometry as the (Gaussian) curvature of the surface; for any 
F„ the consideration of Riemann’s symbols provides a convenient 
way of extending the notion of curvature. 

In this chapter we shall first consider displacement round a 
particular form of infinitesimal circuit, namely, an elementary 
parallelogram. We shall then discuss some of the properties of 
Riemann’s symbols, which occur in the investigation of the 

^“Die direkte AnalyKsis zur iieuuren Relativitktstlieorie in Verh. dcr Ktm, 
Ah, van Wat, te Amsterdam^ Deel 12, No. 6, 1019. Cf. also the same writer’s Der 
JHcai-KalkUl (Berlin, Springer, 1924), II, §§ 12*16. 

“ “ Le paralkUiame de M, Levi-Civita et la courburo rieraannieiine ”, in Rend, 
dtUa R, Ace, dei Lined, (6), Vul. XXVII (hrat half-year, 1919), pp. 42fi-42S. 
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displacement, and shall use these properties to obtain the formula 
for changing the order of two successive covariant differentiations, 
by determining the difference between the derivatives. Lastly, 
we shall return to the question of displacement round any circuit 
whatever, and shall deduce from it the notion of curvature, 
first for a surface, then for any whatever. 

2. Cyclic displacement round an elementary parallelogram. 

Let two elementary vectors, SP, S'P, be drawn from a generic 
point P of a We shall interpret the first as an elementary 
displacement PPi, and give the vector 8'P a parallel displace- 
ment along it; let ^ be the position of the extremity of 8'P after 
this displacement. If we apply the same process to SP, and give 
it a parallel disf)lacement along the j)ath PPg, we reach the same 
point Q (as we have already shf)wn in Chapter V. p. 110), even if 
we retain terms of the type SS'P, S'SP, while neglecting terms 
of the second order in SPainl S'P. We can therefore, in any F,i, 
consider an elementary paralleloffram PP^QP^, 

We shall adopt the obvious convention of rc^presenting by 
8q the change in any quantity q (scalar or vector) in passing 
from P to and by h'q the analogous cliange in passing from 
P to Pg. For a vector, wc shall calculate these changes by the 
formulae of jjarallelism. 

Now let be the change in q on passing from P to ^ 
along the [)ath PPyQ, and D./i the analogous change on passing 
along the other pair of sides PP^Q which with the first pair 
make up the circuit. 

It will be seen at once that (neglecting second order terms as 
explained above) the total change Ay on going round the entire 
circuit in the sense PP^QP^P is D^q — D^q. We shall first 
examine D^q, 

The cliange denoted by 8 corresponds to the displacement 

along PPj; hence, if the value of our quantity was q at P, its 

value at P| will be , ^ 

? -f s? - q^. 

The displacement- along P^Q changes q^ into q^ + so 
that at Q we shall have the quantity 

? + 8gr + 8'(? + Sy) 

= } + Sgr + S'j + h’hq 



174 

so that 


ABSOLUTE DIFFERENTIAL CALCULUS 


As !).£, by its definition, differs from D^q only by interchang- 
ing Pi and Pg, and therefore S and S', we get 

D./1 — S'g- + Sgf + SS'j. 


It follows that the change caused by the displacement round 

the circuit ic ^ 


We must now find an explicit form for this expression, sup- 
posm^ij that the quantity q is a vector u, and calculating the 
increments S and S' by the formul.Te of parallelism. By these 
formula), the changes hu* of the contravariant components will 
be given by the Pfafiian (p. 138, equation (52) ) 

Su' - —^,,,{th,r}u'Sx,„ ... (2) 

I 

while the changes S'?// will be given by the same Pfafiian relative 
to the increments S'x/,, From (1) we see that we have to calculate 
the bilinear co variant (cf. p. 20, §4) of this Pfafiian. 
DilT(*rentiating (2) with the symbol S' vve get 

S'S?/' - X,,, S' (?/?, r} u^Sx^ -- r} 8'?/' 8x^, 

1 1 

— S,/, [ih, r) u’ S'Sx,,. 

1 

To expand the first sum, we note that the expressions [ih, r] 
arc functions of the cr’s, and therefore 

8' { 77/ , r } = 

1 a.'c/ 

The second, on substituting for 8'u* the expression analogous 
to (2), becomes 

71 

'^Mi { ih, AW, «} S'arj. hx^, 

1 

or, interchanging i and 1 in order to get the factor w* here too, 

^ihki ( ^^'9 Z) S'a?^. 8xf^, 

1 
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J75 

We have, therefore, taking out the factors u' Sx/, S'x^ in the first 
two sums, 

S'Sm'' == — J'-- [ih, r} r] [U, l)~\u' S®* 8'®* 

1 XJjXj^ 1 J 

— |),/, {t7t, r}M' S'S®A. 

The expression for SS'u^ can be obtained from this by inter- 
changing S and S'; in the first sum we shall also interchange 
h and giving 

SS' 2 /’- = _ f - ® [ih, r) —'£,1 { Ih y } { hi, l)~\u' S'®,, Sx^ 

1 1 J 

— S,,, [ih, r] u' 88'®,,. 


In taking the difference 8'8u^ * - SS'w’* the third sum cancels 
out, because SS'x,, - (cf. pp. 18, 19, § 4), and there remain 

the terms involving the indices i, h, Jc, in which u* Sxj^S'x,^, can be 
taken out as a common factor. If we intn^duce RiemanrCs symbols 
of the second kind^ 


1 


}(?, r, h, Aj=-1,2, 


..n), (3) 


we shall therefore have 

Aa'* ™ (8'8 — 88') . (4) 

1 

This formula shows that the required increment Au depends 
on the vector u, on the two vectors 8P, 8'P which define the 
parallelogram, and lastly, on the metric of the manifold, through 
the quantities | /r, hk) . From (4) it follows as a particular case 
that for Euclidean spaces Riemann’s symbols as just defined are 
all zero, whatever may be the co-ordinates x chosen for reference. 
In fact, for such a manifold, we have Aw/ = 0 (r -- 1, 2, • . . w), 
since any vector resumes its original value after j)arallel dis- 
placement round any closed circuit whatever. Hence*, the right- 
hand side of (4) vanishes for every r, and for any value of the 
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vector u and of the displacements SP, S'P, i.e. for any values of 
the arguments Sx,,, The coefficients {ir, hk) must there- 

fore vanish separately. 

It will be useful to point out at once the following two pro- 
perties of the operator A: 

(а) When applied to a product, it behaves like a symbol of 
ordinary differentiation, i.e. 

A(?/r</>) --- ^A^ + ^A0, 

which can be verified directly, by calculating first then 

&c.; 

(б) when applied to a function of position, the result is zero, 
as is obvious from the meaning of the symbol. 

If instead of the increments Au* we wish to find those of the 
CO variant components, we can use the relation 

n 

Uj = 

1 

and therefore, from the properties of the operator A. 

n 

AUj ----- 'L,.aj,.Au* 

1 

n 

= —'^rihk djy { ir, kk } Sx^ S%. 

If we introduce Riemann^s symbols of the first kindy 

n 

{ij, hk) hk} , .... (6) 

1 

we can sum with respect to r, and can then write 

Au^ Sjv, S'x;„ . . (4') 

1 

which is analogous to (4). 

Solving (5) w(^ get Hicmann’s symbols ot the second kind in 
terms of those of the first kind by the formula (the inverse of 

> 

{iV, M) = AA.) . . . (6') 

1 
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3. Fundamental properties of Biemann's symbols of the 
second kind. 

As can be seen from their definition (3), Riemann’s symbols 
of the second kind are functions of position, depending on the 
coeiiicients a^/^, their first derivatives (contained in Christoffers 
symbols) and second derivatives (contained in the derivatives of 
Christoffers symbols). They have the following fundamental 
properties: 

{a) They are antisymmetrical in the last two indices, i.e. 

{ir,hk) = — . . . (G) 

whence in particular 

{ir, hh) 0. 

This property follows immediately from (3). 

(h) They constitute a mixed tensor,^ contra variant with respect 
to the second index and co variant wdtli respect to the other three, 
so that the symbol {ir, A/c ) coidd also be denoted (as is sometimes 
done) by To prove this, consider the invariant 

F = 

1 

where the jfj’s are given (but completely arbitrary) functions of 
position, so that Ap^ - 0. If we give F a displacement round 

an infinitesimal circuit, we find (remembering the behaviour of 
the operator A) 

AF - - S,. (u^ Ap^ + p. Ati'^) 

1 

n 

1 

As F is invariant, this quantity must also be so; replacing 
Au*' in it by its expression (4), we get the quadrilinear form 

A jp = {ir, hk } Sajy, 8'®*, . . (7) 

1 

which expresses the required property of the Riemann’s symbols, 
since the simple systems pr, «*, Sx/„ S'jc* are all arbitrary. 

^ Very generally, especially in works on the Theoiy of Relativity, called the 
Jtmnxinn-ChT^toffd tensor, 

( D 066 ) ^ 
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We can use the tensor character of Riemann’’s s 3 ntubols to 
obtain a second proof of the fact that Rieniann’s symbols are 
all zero for a Euclidean (the first proof is an immediate con- 
sequence of (4), as was shown in the preceding section). In fact, 
the definition (3) shows at once that they vanish in Cartesian 
co-ordinates, and in consequence they vanish in any other system 
of co-ordinates. 

(c) They have an important cyclic property with respect to 
the three indices of covariance, namely, 

{ir.lik] + {hr, hi] + {hr, ih) — 0. . . (8) 

To prove this, we again take F and formula (7), but we suppose 
that in them the p’s are derivatives of an invariant function / 
of position (whose numerical values arc otherwise arbitrary), 
and we also take as vector u an infinitesimal displacement with 
components iC' — dx^, which is also arbitrary. With this choice 
F becomes 

F - dx, == df, 

1 VXf 

and (7) becomes 

(8'S — 8S') df ■= — hh) p, dx, 8% 8%. . (9) 

I 

Interchanging cyclically the three infinitesimal vectors denoted 
by the operators d, 8, S', we get the other two formulae: 

(d8' — 8'd) 8/ hk \p,. 8x, 8'x,, dx,„ (9') 

1 

(8d — dZ) 8f — — {ir, hk f p, 8'a-,. dx,, Sx,,. (9") 

1 

Now on the right-hand side of these last two formulae we can 
arrange to have the product dx^ Sx;, 8'x^ in the general term, 
merely by a suitable interchange of the indices of summation. 
We can then add (9), (9'), and (9"); the left-hand side gives 0, 
since the terms cancel out in pairs (e.g. 8'8df— S'rfS/ — S'(Srf/‘ 
— d8f) = 0, since / is a function of position); and we get 

n 

0 = [{^^5 hh{ [hr, hi |* -f- {hr, Pr 

1 

As p,,, dxi, 8xf^, 8'xj^ are arbitrary, (8) follow^s at once. 
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4. Fundamental properties and number of Riemann’s symbols 
of the first kind. 

Riemann’s symbols of the first kind, as defined by (5) (i.e. 
the quantities obtained by compounding the quadruple system 
of the symbols of the second kind with the system of coefficients 
aij^) have the following properties: 

{a) They are covariant with respect to all four indices, so that 
they may be denoted, as is often done, by this follows 

from the definition, in consequence of the law of contraction. 

The remark that a Euclidean has all its Riemann’s symbols 
zero IS true here too, whatever may be the system of reference. 

(6) They are antisymmetrical with respect to each pair of indices, 
BO that wc have identically 

(y, hk) = — (y, M), .... (10) 
(y, hk) -- —(ji, hk) (11) 

The identity (10) follows at once from (5), and from the 
analogous property of the symbols of the second kind. To j>rove 
(IJ) we shall follow a method analogous to that used in § 3 (&), 
taking as invariant the scalar product of two arbitrary vectors 

u* V, 

F ™ 

I 

Applying the operator A and remembering that in a parallel 
displacement the scalar product does not change (so that A F - -- 0) 
we get 

0 + . . . ( 12 ) 

i' 1 

The expression for Au'^ is given by (4), by writings in it instead 
of r; that for Avj is given by (4'). Substituting, we get 

0 = Vj I ij, hk) 8x,, S'x^ + S,,** (ij, hk) 8x,, 8%. (12') 

1 1 

In the first sum we express r>j in terms of the contra variant 
components v,., and then, remembering (5), we sum with respect 
to we get successively 

n n 

^mkr '»'■ (»»•. **) S'®*; 

1 1 

lastly, changing the indices i and r into j and i respectively (to 
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get the tma in the same form as the second part of (12') ), we 
get n 

{ji, hk) S'a;*. 

1 

We can now return to (12'), and taking out the common 
factor w* Sx,^ h'x^. wc get 

0 = [{ji, hk) + {ij, hk)] v' hx^ 8 'a:*; (13) 

1 

from this, since u\ v*, Sir/,, h'xj. are arbitrary, we get formula 

(c) Riemann’s symbols of the first kind have also a cyclic 
property analogous to that of the symbols of the second kind 
and immediately deducible from the latter, namely, 

(ij, hk) + (hj, ki) + (kj, ih) — 0, . . (14) 

whore the second index remains fixed and the other three are 
permuted cyclically. This formula follows directly from (8), 
on multiplying by and summing with resj)ect to r. 

As each of the terms in this sum is antisymmetrical, we can 
at once obtain from (14) a similar identity 

(y, hk) + (ih, kj) + (ikjh) = 0, . , (14') 

in which the first index remains fixed and the other three are 
permuted cyclically. 

(d) Lastly, for the symbols of the first kind, there is a pro- 
perty of permutability, which is a consequence of the foregoing 
properties, and according to which we can interchange the two 
pairs of indices without altering the \ alue of the symbol; namely, 

(ij, hk) ^ (hk, ij) (15) 

To prove this, take (14') and the three other identities obtained 
from it by cycUc permutation of the four indices in the order 
i,j, h, k, or 

(ij, hk) + (ih, kj) + (ik,jh) = 0, 

(jh, ki) + (jk, ih) + (ji, hk) = 0, 

ihk,ij) -f- (h{,jk) + (hj, ki) = 0, 

(ki, jh) + (kj, hi) + (kh, ij) = 0. 
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Adding the first and fourth and subtracting the second and 
third of these identities, and using the property of antisymmetry, 
we see that the terms cancel out in pairs, except the four imder- 
lined, which give 

2{ij, hk) — 2(M, ij) = 0, 

whence the required property follows. 

We shall now calculate the number of independent Rieymnn^s 
symbols of the first kind, A quadruple system (cf. § 2, p. 65) 
has in general n^ elements, if there are n independent variables^ 
The number of distinct Riemann’s symbols of each kind is 
smaller, however, as these symbols are connected by the identities 
we have just proved. Wo shall determine this number for the 
symbols of the first kind, dividing them into three classes, and 
counting separately those in each class, as follows: 

(1) Symbols with only two different indices: these are of the 
type (?j‘, y), since the other possible arrangements are either 
reducible to this, or give zero values. Each pair of unequal 
indices i, j therefore gives a single symbol of this class, which 
thus contains 

n{n — 1), 


(2) Symbols with three different indices: these are of the type 
{ij, ih), since here too the other j)ossible arrangements are reducible 
to this or give zero values. Every triplet of unequal indices will 
give three symbols of this type (since the repeated index may be 


any one of the three); since there are 


n{n — 1) (n — 2) 
1 •2-3 


triplets, 


the number of distinct symbols of the class we are considering 
amounts to 


n{n 


■ ■ > 


(3) Symbols with four different indices: a set of four different 
indices i, j, h, k will give the three symbols 

iij, hk), (ih, kj), {ik, jh), 

while every other possible arrangement gives a symbol reducible 
to one of these. But these three are not independent, on account 
of the cyclic relation (14'). It follows that each of the 
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— l)(n^2)(n — 3) 


1 • 2 • 3 • 4 

symbols, so that the total number of these is 

n{n — l){n — 2)(n — 3) 
12 


sets of four indices gives two distinct 


Adding these three partial totals, and simplifying, we get 
the total number N of independent Riemann^s symbols of the 
first kind: 


N = 


n^{n^ — 

i2 




Thus for an ordinary surface (n = 2) we have N = 1; 
for a three-dimensional space, N = 6; for a four-dimensional 
space, N — 20. 


5. Bianohi’s identities.^ 


Bianchi’s identities are cyclic relations between the covariant 
derivatives of Riemann’s symbols of both the first and second 
kinds. They are obtained as follows: 

Take formula (3), which defines the symbol of the second 
kind {ir, hk\, and differentiate it with respect to We note, 
however, that on differentiation the last part, whicli consists of 
terms of the second order in Christoffel’s symbols, gives terms 
made up of the product of one such symbol by the derivative of 
another: the essential point for us is that, with reference to a 
specified point P, by choosing co-ordinates which are geodesic 
at that point, we can make all these terms vanish. We cannot, 
however, treat the first part in the same way, as the geodesic 
co-ordinates make Christoffers symbols vanish but not their 
derivatives. We shall therefore have the formula, valid at the 
point P for co-ordinates geodesic at P, 




9“ . -J. 1 < -7 ) 


* These identities were stated without proof by Padova, on the strength of 
a verbal communication of Ricci (cf. “ Sulle deformazioni infinitesime *\ in Rend, 
della R. Aoc, dei Lincei^ (4), Vol. V (first half-year, 1889), p. 176). They were then 
forgotten even by Ricci himself. Bianchi rediscovered them and published a 
pr^Kif obtained by direct calculation in 1902 {Ibid.^ (5), Vol. XI (first half-year, 
1902), pp. 3-7). 
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Write down also the two other fomnilee obtained from this 
by cyclic permutation of the indices h, k, I, leaving i and r fixed; 




dxidx^^^ dxi,dxk^^ ’ 


jir. a} = r} - r). 


Adding the terms on the left and right of these three equations 
we get 

a 

^ Q.r. hJc.', ^ - [ 'ir 

dx,' 


M} 4- [ir, kl] + [ir, Ih} =■■ 0, (16) 


which holds at the point P, in the particular system of co-ordinates 
chosen. Now consider the following mixed tensor of rank five 

kk}i + {ir, M},, + {ir, Ih),,, 

in which the suffixes outside the brackets denote covariant differ- 
entiation. This system, referred to the point P and to co- 
ordinates geodesic at P, is identical with the left-hand side of 
(16), since in these conditions the covariant and ordinary 
derivatives are identical; all its elements are therefore equal to 
zero, and it will therefore be identically zero whatever may be 
the system of reference (cf. Chapter IV, p. 84). We have thus 
proved the identity 

{ir, hk]i + {ir, kl)^ + {ir, lli)^ — 0, . (17) 

which is a first form of the result established by Bianchi. 

For the symbols of the first kind the analogous relation is 
easily proved from the definition of these symbols given by (6). 
In fact, taking the covariant derivative of this formula, and 
using Ricci’s lemma, we find that 

n 

(ij, hk)i = M},. 

1 

From this, permuting cyclically the indices h, k, I and sum- 
ming, we get, by (17), 

{ij, hk)i + {ij, Jd)k + {ij, lh)„ --= 0, . . (17') 

which is the second form of Bianchi’s identity. 
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6. Commutation rule for the second covariant derivatives. 

An important application of Riemann^s symbols occurs in 
the formula which gives the relation between the two systems 

, jO) 

obtained by double covariant differentiation from a generic 
tensor where (/) stands for the aggregate of indices of 

covariance . . . i„t and (j) for the aggregate of indices of contra- 
variance ji • • - jfj,. To simplify the formulae we shall consider a 
mixed double system with the remark that the procedure 
is similar if there are more indices. 

We start as usual from the bilinear form 

F - 

1 

the invariance of which determines the law of transformation 
of the ^’s; the f*’s are the contra variant components of an 
arbitrary vector and the ufs are tlie covariant components 
of another arbitrary vector u. The procedure will consist in 
calculating, in two different ways, the quantity AF correspond- 
ing to a cyclic displacement round an elementary parallelo- 
gram (cf. § 2), and in equating the two expressions so 
obtained. 

A first way of calculating AF is as follows. We associate the 
increments 8, S' with two sides of the parallelogram, as in § 2, 
and use (1). Note first that from the definition of the covariant 
derivative (cf. Chapter VI, p. 146) SF is given by 

8F = 8xi,. 

, Similarly, applying the symbol S' to this expression, we 
shall get 

8'SF — 

1 

From this, interchanging 8 and 8', we get 

Ujh'xi^hxjf\ 
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subtracting these two equations, after interchanging the indices 
h and k in the second, we get 

ujhx„h%. . (18) 

The other method of calculating this quantity consists in 
applying the operator A directly to the expressioii for F, Remem- 
bering the fundamental properties of A (§ 2 ) we shall get 

1 

or, substituting for A^' and ^Uj the expressions given by (4) 
and (4'), 

A/’ - = A{ { li, hk) u, e ^x„ B'x, 

1 

— '^ijphk A (Pj, hk) uV hXh 
1 

In order to get the second sum in the same form as the first, 
we shall express it in terms of symbols of the second kind and 
of the covariant components of u; to do this we first use the 
property of antisymmetry wdth respect to the first pair of indices, 
and express the in terms of the w/s, so that the sum becomes 

+ '^,n-hk< UP’ hk) Ui 8 x,. S'%. 

1 

Sumnnng with respect to p, and using (5'), we get 

hiiM M f ‘ Hi Sx/. S'**. 

1 

Now, in order to reconstruct tlie second expression for AF 
in a suitable form, we shall first interchange some indices, so 
as to be able to take out the factor Ujhx,^h ' which occurs 
in (18). We must interchange I aiid i in the first sum, and I and 
j in the (modified) second; we shall then get, collecting both sums 
under a single summation sign, 

Al’ ... [il, hk) - A\{lj, M}J 

(DC55) 7* 
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Comparing this expression with (18), and remembering that 
u, the Sx’s, and the S'iC'S are arbitrary, we get the commutation 
formula 



If the system from which we start has m indices of covariance 
and iJL of contravariance, we must consider m vectors deter- 
mined by their contra variant components, and fi vectors u, deter- 
mined by their covariant components; by an analogous process 
we shall find 

71 p m , ^ 

=-?' I 

^ T (20) 

7. Cyclic displacement round any infinitesimal circuit. 

We now return to the order of ideas interrupted at § 2. Given 
a direction u at a point P, we shall give u a parallel displacement 
(in round a closed curve T, infinitesimal, but of any shape 
whatever, and of course passing through P; we propose to 
calculate the change Du^ in a generic parameter of u caused 
by the cyclic displacement. The formula we shall find will be 
merely a generalization of (4), and must reduce to (4) 
if as a particular case we take for T an infinitesimal parallelo- 
gram. 

For an elementary displacement dXf^ we know that the change 
in u’^ is 

n 

du^ -= — {ih, r]u^ 

1 

so that it has the form of a Pfaffian 

.... ( 21 ) 

1 

in which the are functions of position (since they contain 
Christoffers symbols) and of the which are defined along 
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the curve T by the equations of parallelism du^ = We have 
to calculate the integral 

Du^ = f = f . . ( 22 ) 

where we use the operator D to indicate the increment resulting 
from displacement right round the circuit. We shall now con- 
sider any surface a containing the curve T, and we shall call 
r the region of this surface which is within T, and is such that 
T constitutes its complete boundary. We propose to transform 
the integral round the circuit T, which occurs in (22), into an 
integral over the surface P. To do tliis, we shall first introduce 
a system of co-ordinates and on the surface in question, 
defining them by the parametric equations 

^ 1, 2, . . . n). 

The dx’s will consequently be linear functions of dq^ and 
dq^^ substituting their expressions in the Pfaffian (21), this will 
take the form 

du^ = Q^dq^+ Q^dq^, - . ( 21 ') 

where the quantities and Q 29 like the X’s, are defined along 
the curve T. The integral to be calculated will thus be 

Du^ = f (Qidq^-j- Q^dq^). . . (22') 

J Y 

We shall suppose the curvilinear cr^-ordinates q^, jg chosen 
that the sense of integration round T is the same as that deter- 
mined on r (at a generic point) by the rotation (through the 
angle less than 180°) from the positive direction of the line q^ 
(i.e. in the sense of q^ increasing) to the positive direction of 
the line q^- 

The transformation of the line integral (22') into a surface 
integral taken over P could be effected at once if the Q’s were 
defined as functions of position in the interior of P as well as on 
its boundary. But instead of this they contain the w^’s, which 
are given at P, and at points on T have values resulting from 
the parallel displacement along T itself, but are not defined for 
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a point M within F, their values at M depending on the path 
followed in the parallel displacement of u from P to M. We 
shall, however, show that if F is infinitesimal the influence of the 
path of displacement on the values of the w^ ’s at Af is negligible, 
and therefore we may consider the ^s, and in consequence the 
or the ^’s, as functions of position over the whole area F, 
which will enable us to make the required transformation of 
(22').i 

We shall make some preliminary remarks on orders of magni- 
tude, using for this purpose the general existence theorem for 
integrals of systems of ordinary dillerential equations. Such a 
system is constituted by the equations 

(r = 1, 2, . . n), 

which define the functions along a generic line T, start- 
ing from a given set of initial values at P (cf. Chapter II, 
p.23). 

Now the existence theorem assures us that in general (i.e. 
when certain not very restrictive conditions of continuity 
and differentiability are satisfied) the initial values define the 
integrals uniquely, and these integrals and their derivatives 
are continuous functions for values of the independent variable 
within an interval which is not shorter than some assignable 
quantity. 

In our case, granting, what will naturally be the case, that 
the coefficients of ds^ and the reciprocals are finite and 
continuous, as well as theii* first and second derivatives, in a 
certain region round P, and supposing also that the length of 
the vector u at P is limited, i.e. is not greater than some specified, 
but arbitrary, constant U, it can easily be deduced from the 
above-mentione<l existence theorem that — considering the arc 
of the curve of displacement as the independent variable — the 
are defined (as continuous, differentiable, &c., functions), with 
P as starting-point, along any curve whatever, for a segment 
of the ciuwe of length not greater than a certain quantity A 

* W© ahftll in fact her© limit the discussion to an mdiciiti»>n of the general lines 
of the argument, without ])auMing over the details needed to justify the various 
steps of th© prot^f with complete rigour. There is an exhaustive proof in the 
article by H. Tietze: “Ueber I’amllelverschiebuug in Riemann’sc‘hen Ratimen’*, 
in Math. ZeUachri/t, Vol. 16, 1923, pp. 308-317. 
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which depends exclusively on the metric of the manifold and on 
U. Hence it follows in particular that the differences between 
the u '‘ ’s and their initial values are of the same order ^ of magni- 
tude as the length L of the arc along which the integration of 
the system du’‘ -= is effected. 

Further, the area T which we are considering on the surface 
£T is infinitesimal, in the sense that we propose to make it diminish 
indefinitely. It is therefore perfectly legitimate to suppose that 
it is already so small that every point in it can be reached from 
P by a line of length not greater than A and that tlio length of 
the whole contour T is also less than A. 

We thus have the result that if u undergoes a parallel dis- 
placement from P to a point M within the area P, or on its boun- 
dary P, the ’s at M differ from their values at P by quantities 
of the same order as P, if P(<rA) is the maximum length of 
the lines considered. As a first approximation, in which quan- 
tities of the same order as L are neglected, we can therefore 
take the components u‘ as constant and ” over the whole 
area F, including its boundary. 

We can find a closer approximation if we calculate the 
at M by integrating the Pfaflian (21) along a curve PQ, sub- 
stituting, however, for the coefficients their values at P. 
This process involvtjs an error of order L in the values of these 
coefficients, and therefore an error of order L'^ in the values 
found for the u' ’s at M; the choice of the curve PM is indifferent, 
as in this case the Pfaffian becomes an exact differential. As 
the coefficients are constants, the integi’ation can be at once 
effected, and will give linear functions of the x'a for the u' ’s. 
We shall thus have obtained the as functions of position at 
all points (including the boundary) of the area F, neglecting 
quantities of order P‘^, 

We can get a third approximation by substituting these 
approximate linear expressions of the ?/ ’s in the coefficients 
X,f,. The Jf’s will thus become functions of position, as was 
required, defined throughout F, including its boundary T, aud 
coinciding on T (if we neglect with their accurate values 
as already defined. These values of the A’s can be used to 

^ Thit» means that the differences iti question are not greniter than the product 
of Zr by a certain Jtnitr cMi€*fHcitint, which docis not depend on L or on the curve of 
integration, but only on //, and the metric of the manifold. 
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calculate the integral (22), which will give the value of 
the error being now of order L®. It was necessary to carry 
the approximation thus far, since the other two approximations 
make tfj an exact differential, and would give the value zero for 
the obvious meaning of this being that Du*' is a quantity 
of order 

We shall therefore return to (22), giving the X’s the meaning 
just explained, and transform it into the form (22') by using the 
parametric equations of the surface a\ and will now 
represent functions of position defined at all points of F. In 
consequence^ we can transform the line integral into a surface 
integral, getting 



or also 

Du^ = (Q 2 dq^) ^1 — ^ (^1 (23) 

We must now find the value of the integrand on the right of 
(23), which can be done by means of the following considera- 
tions, without writing out the expressions for and at full 
length. 

Let the operator 8 denote the increment of a quantity corre- 
sponding to a displacement along the line = constant, when 


^ By tho ordinary formula}, 

where / is a fnnrti(m of </i and which is continuous, together witVi its first 
derivatives. Usually in these formulni 72 interpreted as Cartirsian 

co-ordinates in a plane, and th<j sense in which the curve 7 ’ is descriht'd is defined 
by the condition that the pair of directions «, n {a the tangent Ui the boundary in 
the sense in which it is described, n the normal to T drawn inwards) is congruent 
in the plane with the p;iir 71, 72. The formula} obviously hold, howeier, indepen- 
dently of this interj)retation in j)lane geometry, and can therefore he applied even 
if 7it 7a o-re any curvilinear co-ordinates whatever, the sense of describing the curve 
being determined by an analogous criterion to that just explained, provided tViat 
we introduce, at a generic point of the Ixmndary curve, the directions tangential 
to 7i (i.e. 72 = constant) and 72, in the sense in which the respective parameters 
increase. Now we have already supposed (p. 187 ) that the auxiliary curvilinear 
co-ordinates 71, q» behax-e as regards sense in precisely this way. Hence the 
equations {O) hold both in magnitude and in sign. 
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increases by similarly. S' corresponds to an increase dq^ 
in J 2 alone. We shall therefore have 


8 ®.- = 

SX = \^dq^, 


. . (24) 


and analogous expressions for any function of position. Now 
note that in ( 21 ') the first term represents precisely the incre- 
ment of due to a displacement of the first type {dq^ = 0 ), 
so that 

=■ dq-^y 

and, the second term having an analogous meaning, 

= (?2 dq^. 

We can therefore write (23) in the form 

= 1^, [58'm’' — 8 'Sw’']; 

and from this, remembering equation (4), wc get 

Du^ = f { ir, hk } u' Sx,, 8\. 

J r I 

Now let us put 77 for the parameters (in F„) of the lines 
grg ~ constant, q^ ^ constant, i.e. 

ci i S'Xi 

^ = s- ” = 8>;' 

where Ss, S's are the lengths of the displacements, along the 
lines 3^2 “ constant, = constant, whose components are Sa?*-, 
8'Xi. Then we can also write 


Du' ^ fr Sfi S's 


(25) 
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where for ahortness we have put 

fr = SiA* {ir, hh) 

We can now remark that if S' is the angle between the co- 
ordinate lines ^29 element of area (cf. Chapter V, p. 99) is 

dr — 8s S's sinS, 
and therefore (25) becomes 

Du’^ I f dr. 

J r sinS 


By the mean value theorem, if 

fr 


r^i 

_sinS_Uy 


denotes the value 


of the function of position -*/- at a suitable point M^^ (not known 

sinS 

a priori) within T, and DV the area of the region T, we can also 
write 




-- /"I Dr. 


fr 


Now the value at of the function of position . differs 

sirio)- 

from its value at P by quantities of order L (since the distance 
is of this order); since the area DV is of order the error 
caused by substituting the value at P lor that at M is of order 
which we have agreed to neglect. We shall therefore have 

fr 


D^r 


sin^)- 


Dr, 


or 


DuT 


DP 
sinf)' I 


(26) 


In this formula the quantities and 17 *^ represent the para- 
meters of tin*, lines ^ angle between these 

lines; the values of the and of the Riemann’s symbols refer 
to the point P. It will be seen that the influence of the circuit 
of displacement appears in this formula in three geometrical 
elements, which serve substantially to determine the circuit 
itself, namely: two directions rj {a priori any whatever) which 



PARALLELISM AND CURVATURE 


193 


deterniine the section on which the circuit is supposed drawn, 
together with the angle between them; and the area DF of the 
circuit itself (measured according to the metric of F„). 

8. F^r^s’s formula. 

From (26) we immediately get the fundamental formula which 
serves as a link between parallelism and curvature. 

Take any (fourth) direction v dravoi from P. Let a be the 
angle between v and u, and a + -Da the angle between v and 
u + Du; we propose to calculate Da. To do this we take the 
scalar product 

u X V = cosa 

(assuming that u, like v, is a unit vector), and differentiate with 
the symbol D, remembering that Dv --- 0 since v is a fixed 
vector. We get 

V X Du — sina Da, 

or, substituting for tlie scalar product on the left its expression 

n 

'Lf.Vj. Du^, and using (26), 

1 

DF 

sina Da = — . . 

If in this formula we express the in terms of the and 
sum with respect to r (remembering (5) ), we get 

DF “ 

nmaDa — — - - I* ■>?*'. 

SlTltt 1 

Now if the directions u, v coincide or are opposite, this formula 
reduces to an identity of no interest, since on the loft we have 
sina — 0, and the right-hand side vanishes from the antisym- 
metry of the Itieinann’s symbols. Excluding tliis case, we can 
divide the whole equation by sina, and we get PMs's formula 

DF 

Da- r . (27) 

Sina sm^ 1 

9. Application to surfaces. Gaussian curvature of a 

In considering the particular case of a V^, i.e. an ordinary 
surface, the directions u, v must of course be contained in the 


blliuii 
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section (the only one there is) defined by Y]. Since the only 
purpose of these last two vectors is to specify the section on which 
the circuit is drawn, we can make them coincide with u and v 
without loss of generality; (27) will then become 
np M 

Da . . (27') 

sin^a 1 

Since for n — 2 Riemann’s symbols which do not vanish 
are represented by the single arrangement of indices (12, 12), 
this formula can be further reduced to 

— ijr 

Da = , (12, 12) {u^v^ — 

sm^a 


or finally, remembering the expression for sina in terms of the 
parameters of the two directions u, v (cf. Chap. V, p. 94, formula 
(i)') ), we get 

DV 

Da == — (12, 12). 
a 


It is usual to put 

(J2, 1^) ^ 

a 


(28) 


so that the foregoing formula becomes 


Da ^ _ 
DF 


(29) 


From this it will be seen that the function of position K 
defined by (28) is an invariant; it depends on the coefficients 
and their first and second derivatives, and is identical with 
the quantity which in the theory of surfaces is known as the total, 
or Oaussian, cun?ahire (the product of the curvatures of the prin- 
cipal sections),^ The equation (29) can be put in a more instructive 
form if we introduce the angle of parallelism e, i.e. the angle 
between u and u + Dvl (or between u and its parallel obtained 
by displacement round the circuit), measured in the sense in 
which the circuit is described. We can also say that e is the angle 
through which u has been rotated as a result of the cyclic dis- 


^ See also below, Chapter IX, p. 261 . 
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placement. It is obvious that € has the same absolute value as 
Da, but we shall see that as regards sign the precise relation is 

€ = -Da. 


To show this we need only remember the convention adopted 
above (§7) that the circuit 
is to be described in the 
positive sense with respect 
to the co-ordinate lines 
q^, or from the versor ^ 
to the versor rj. As ^ and tq 
now coincide with u and v 
the sense in which the cir- 
cuit is described is from u 
to V (through the convex 
angle). Accordingly (see 
fig. 3) if Da > 0, u + Da 
is outside the convex angle 
uv, and is therefore reached 
from u by moving in the 
negative sense (e < 0), and vice versa 
write (29) in the form 

Dr * 



Fig. 3 


We can therefore 


(29') 


which gives the following important interpretation of the curva- 
ture K : K is the ratio of the angle of parallelism (taken with its 
proper sign in relation to the sense of describing the circuit) 
to the area of the circuit. 

In the case of a Euclidean Fg Tliemann’s symbol is zero 
(of. § 4) and therefore K — 0. This can also be deduced from 
the geometrically obvious fact (already used in § 2) that the 
parallel displacement is integrablc (i.e. that the result does not 
depend on the curve of displacement). 


10. Riemannian curvature of a F,,. 

If instead of a surface we consider any V,^ whatever, the 
notion of curvature becomes less simple. If P is a fixed point 
of the F,„ then with every section through P determined by two 
arbitrary directions ^ drawn from P we can associate an 



196 ABSOLUTE DIFFERENTIAL CALCULUS 


invariant K, which is called the Bi-emannian curvature of the 

at P with respect to the section considered. Following Rie- 
mann, we construct the geodesic surface determined by the 
point P and the two directions then take the Gaussian 

curvature K of this geodesic surface as the curvature of the 
F„ at the point and in the section in question. In general the 
Riemannian curvature differs in the diiferent sections. 

The foregoing considerations enable us to give another impor- 
tant definition of the Riemannian curvature, and to find the 
analytical expression for it. 

Given the elements P, t), construct the geodesic surface 
g defined by them and consider an infinitesimal circuit on 
passing through P, of area PF. Take one of the given directions, 
^y ^5 give it a parallel displacement with respect to the 
surface g round the circuit, in the sense ^ yj. Now calculat^e 
by P^rfea’s formula the change Da in the angle between ^ and yj, 
i.e. the dilTerence between its values before and after the dis- 
placement. The curvature K will then be given by (29). Now 
from Seven’s theorem that an infinitesimal [)arallel displacement 
with respect to the surface g (using the mof ric of 9 ) can be rejdaced 
by the analogous infinitesimal parallel displacement in F,i, it 
follows at once that this method of calculating K does not really 
involve the use of the geodesic sm*facc g. Hence the Riemannian 
curvature K can be defined as the ratio (with sign changed) of 
Da to DF, where Da is the change in the angle between the given 
directions yj caused by the parallel displacewenf in V„ of one of 
these directions round an infinitesimal circuit of area DF, belonging 
to Ute section 5, yj, and described in the sense ? — yj. W e therefore 
have 


K - 


Da 

Dt' 


(30) 


as in the Fg. 

The explicit expression for K corresponding to this can be 
obtained from (27'), and we get 


K =- — ^ijhk{ij,hh) 

sin^a 1 


UV'UV . 


• ( 31 ) 


The symmetry of the right-hand side in u and v provides 
formal confirmation of the fact that it is immaterial whether 
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we displace u or v, as we get in either case the same value for 
Da and the same value for K, 

A third definition of the Riemannian curvatxire can be 
obtained from the proof of the following lemma. 

Take any three points P, P', P" on a surface Fg, and join 
them in pairs by arcs of geodesics, forming a so-called geodesic 
triangle. If are the angles of this triangle, the quantity 

€ ^ -j- -j- — TT . . . . (32) 

is called the geodesic excess. 

Now take any point on the triangle, and give a paralhd dis- 
placement round the triangle to the direction of the side at 
that point, or of one of the sides if the point is a vertex; e.g. 
sf arting from P, the direction u at P of the side PP' (taken in 
the sense P — P'). 

We want to show that the angle between the initial and 
final positions of u (measured from the initial position in the 
sense which at each vertex is dependent on the s6nse PP'P" 
in wliich the circuit is described), i.e. the angle of parallelism 
(relative in this case to a circuit of a special kind, but without 
the restriction of being infinitesimal), is the same as the geodesic 
excess €, 

For the proof we shall follow u in a cyclic displacement 
round the circuit PP'P", noting in the first place that from 
P to P' u remains tangential to the side PP', on account of the 
autoparallelism of geodesics (cf. Chapter V, p. 101). On arriving 
at P', u is thus inclined at an angle tt — (outside the triangle 
at P') to the side P'P" (in the sense indicated by these letters); 
more precisely, u is behind the tangent to the side P'P" by an 
angle of tt — the sense of rotation at P', as we have already 
said, being determined by the sense of description PP'P". In 
the displacement from P' to P" this angle remains unchanged; 
at P" there will be a further loss of tt — (with respect to 
the new side P"P); and finally at P yet another loss of tt — <f> 
(with respect to PP'). Taking all these together, we see that 
in its final position the parallel to u has been rotated away from 
its initial position through an angle of Stt — (t^ -f- <f> + <^") 
in the negative direction, or e — 27r in the jMDsitive direction. 
Now in the pencil of directions at a point an angle is determined 
geometrically by a quantity of the form 0 + 2n7r, where n is any 
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integer; hence we have proved that cme value of the angle of 
paralhlism is the geodesic excess €. Now in a Euclidean space 
the geodesic excess is zero, and in any manifold whatever, for 
an infinitesimal triangle, the excess is infinitesimal; we therefore 
see that for reasons of continuity the value adopted for the 
angle of parallelism is in fact the most suitable, being that which 
tends to zero with the triangle. 

The lemma just proved is rigorously tnie in a whatever 
may be the magnitude of the geodesic triangle considered. If 
in particular we apply it to an infinitesimal triangle, we can 
substitute c for — Da in (29), so getting 



a formula wliich defintis the Riernannian curvature as the ratio 
of the geodesic excess to the area of an infinitesimal geodesic triangle 
lying in the section considered^ and hainng ovie vertex at the given 
point P. It will be seen that this is an obvious extension to n- 
dimensional manifolds with any metric of an elementary theorem 
in spherical geometry (the area of a spherical triangle -= the 
spherical excess X the square of the radius); tlie latter 
theorem, however, unlike the former, holds also for a finite 
triangle. 

We must confine ourselves to a mere reference to the 
important researches of Professors Schouten^ and Bompiani- 
on the simultaneous cyclic displacement of several directions, 
and even of the whole sheaf of directions drawn from a single 
point. Their work throws light on the theory of Riernannian 
curvature under various new aspects. 

11. Case of a F 3 . The tensors aij, of Ricci and of Einstein. 

For a manifold of three dimensions the symbols of the first 
species (y, hk) which do not vanish reduce substantially, in virtue 
of (10), (11), and (14), to the scheme 

(i+l i+2,k+l k+2) (/, k - 1, 2, 3), 

references given in n«>tf (1), p. 172. 

®“Studi augli apazi curvi'\ in Atti del Ji, let, Vcnrto, Vol. LXXX, 1921, pp. 355- 
336, 889-869, 1118-1145. 
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with the convention (p. 161, § 9) of regarding as equivalent 
two indices which differ by a multiple of 3. 

Using this notation, we now introduce after Ricci the double 
symmetric system 


^ (i+1 i + 2, /: + 1 Jc+ 2) 
a 


Jii,k = 1,2,3), (3e3) 


which constitutes, as we shall now show, a contravariant tensor. 
In fact, if wc make use of the contravariant system € (Chapter 
VI, p. 158) we see that (33) is equivalent to 

rs) (i, k - 1, 2, 3): . (33') 

1 

whicli proves the assertion. The verification of this last formula 
is immediate, when we remember that of the various determina- 
tions a priori possible for the pair p^ q, there are only two, 

p i +1, ? == i + 2 

and p — i 2, g — i + I, 

corresponding to which has a value different from zero, 

viz. in the former case, — in the latter. Similarly, for 
\/a va 

the other pair r, s, the only determinations to which there corre- 
sponds a non-vanishing arc 

r A; -f 1, s k + 2 

and r A; + 2, s = A: I. 

The sum reduces therefore to four terms all expressible as 

(i + 1 i+% it + 1 Jc + 2) 
a 


thus giving the result stated in (33). 

The name Ricci's symbols is sometimes given to the 
just defined, or, more specifically, to the elements 

aif, — Hif, afjaji/f (i. A; = 1, 2, 3) . (34) 

1 


of the reciprocal covariant tensor. 
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It is worth noting that equations (33), if we take account of 
the properties of Hiemann’s symbols in respect of 83 Tnraetry 
and antisymmetry, can be solved so as to give the values of these 
symbols in the form 

{ij, hk) = a”'* . . . (33") 


In fact, expanding the second member and attending to the 
definition of the covariant system c, wo find the value zero corre- 
spending to every set of four % A, Jc for which the first member 
vanishes, and the value corresponding to a set of four of the 
type i + 1, i + 2, ifc + 1? ^ + 2; (33") therefore follows from (33). 

By contracting the Ricci tensor a by means of the funda- 
mental tensor (the coefficients of or their recij)rocals) we 
obtain the linear invariant of the tensor a 

af- . . . (35) 

1 1 

We may point out another formal relation of which use will be 
made in Chapter XII. For any F,^ whatever wc can derive from 
the Riemannian tensor, by contraction with respect to two in- 
dices, the covariant double tensor 

{ijjik) (30) 

1 

which, in virtue of equations (5), can also be written in the 
form ,, 

= S,. [ih, hh) (36') 

1 


Tliis tensor was noticed by Ricci, who applied it to the study of 
the local distribution of curvatures in a F,,; it was afterwards 
taken up by Einstein, who gave it a fundamental place in the 
theory of relativity (in which n — 4): it is commonly known 
as the Einstein tensor. 

For n — 3 the a^^/s are related in a simple way to the Gf/s. 
To bring out the connexion simply and neatly, it is convenient 
to make use of two properties of the ternary systems e: one 
expressed by the identity (which can be verified immediately) 


8''8J - 8^;si 

1 


(p, q,r,s =■■ 1, 2, 3), . (37) 
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the S’s having the usual meaning (0 for difEerent indices, 1 for 
equal indices); and the other translating, as it were, the definition 
of as a reciprocal element 


Substitute in equations (36), in place of the a^*’s, the second 
member here; and, in place of the {ij, hk)\ the expression for 
them in (33"), with changed into — Cpj.*. Taking account of 
(37) we find 

» 




Of the four terms obtained by expanding the product of the 
bracketed expressions, the two which are positive are merely 
interchanged by interchange of j) with q and of r with s, which 
does not alter the product and similarly with the negative 

therms. We can therefore confine our attention to one term only 
of each kind, and sui)press the factor I, If we take, e.g. 

S^^S?(8: 8" — 8'* 8^), 

V t. ^ k p p k'^ 

and bear in mind the meaning of the symbols 8, we find that the 
sum reduces to one with respect to v and p only, giving 

3 

Giic (a.^ ttip o”'’), 

or finally, having regard to (34), (36), 

Gf/f ■■ - a 9 h — Ij 2, 3). . (38) 


12. Curvature of a manifold of three dimensions around a 
point. Principal directions and invariants. 

Consider in a Fg a generic section or facet, f defined by two 
directions (versors) u, v, whose parameters are {i — 1, 2, 3), 
issuing from the same poirit P. Let w be their vector product 
(Chapter VI, p. 159), the moments of which are 

3 

(v ~ 1, 2, S). . . (39) 

CJoiresponding to tbe section f in our manifold we have the 
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Riemannian curvature K given by (31), with n = 3. In the 

sum '^ijhk oi (31) it is convenient to introduce, in place of Rie- 
1 

mann’s symbols (i/, hk) the expressions for them given in (33"). 
Taking account of (39), we find immediately 


K = 


3 

2, 


yp 






. (40) 


Deferring for a moment the illustration of this formula, we recall 
the fact that in general the length w of the vector u A v (the 
vector product of u and v) is given by the product of the lengths 
of u, V by the sine of the angle between them. For the Euclidean 
Fg this implies, as already mentioned (Chapter VI, j). 169), the 
identity of the moments (39) referred to Cart(*sian co-ordinates with 
the ordinary components (orthogonal projections) of the vector 
product u A V. For a general Fg, it is sufficient to remember 
that we can choose co-ordinates which are locally Cartesian at 
any assigned point P, so that the have the values 8f (and 
so that also — though this is not important for the present pur- 
pose — their first derivatives vanisli). Now both in the measures 
of the lengths and the angular distances of vectors proceeding 
from the point P, and in the definition (39) of the ^c,/s, there 
enter only the components of the vectors and the values of the 
a^^.’s at P. Locally then everything is the same as for Euclidean 
space.^ 

Turn now to our case, in which u and v are unit vectors. It 

follows that w = sina, whence — = constitute the 

w sma 


* We can also of course calculate the leiurth of yr. Thus we note that, )>e8ideB 

(89). wc have the eij[uivalent formuUe the eoiitravariant components: 

«’■' = Sfti “A 

Hence, in view of (.‘57) (the index of summation in which we may suppose 
transferred to the first place), 

3 3 . , 

Wy = ^ijhk w* vJ Vj^ 

8 . H 3.8, 

= St t** S ; vi vj — St Vi Sj v3 uj 

1 11 1 

= — {uv cosa)- = u® V* sin^a. 
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moments of the direction X normal to the section /. The sense 
assigned to the normal by (39) is characterized by the system 
e or, geometrically, by the sense of the trihedron formed at P 
by the positive directions of the co-ordinate lines. In fact, from 
these equations (39), if we suppose, for example, that the lines 
7.5 2 are taken in the directions u, v, there results 


= W 2 = 0 , 


~ > 0 , 
V a^i a22 


so that (Chapter V, p. 127) w makes an acute angle with the 
positive direction of the line 3, whose parameters are 

= 0, 6^3 = - y — r_ !> 0. 

va33 

Thus X is perpendicular to / at P, and directed so that the 
trihedron u, v, X has the same sense as the trihedron formed 
by the positive directions of the co-ordinate lines at the same 
point P. 

Equation (40) now takes the form, given by Ricci, 

A" = S,^a.,^A*'A^ . . (40') 


which defines the curvature of a variable section f through P as a 
hoinogeneous quadric in the parameters A’' {or in the w,oments A^,) 
of the norinal to f , the sense of the normal being indifferent, since 
the expressions in (40') do not change when we change the signs 
of the A’s. 

The dependence of K on the direction X of the section with 
centre at P is of purely local nature. We can therefore, in accord- 
ance with an observation made above, make use of the elemen- 
tary criteria of analytical geometry just as if it were a question 
of ordinary space — we have only to take co-ordinates which are 
Cartesian and orthogonal at P. The A*'’s then become direction 
cosines, and we have for K (except for a different signification 
of the coefficients a^^) the same expression as the one which 
characterizes the distribution of moments of inertia (of an assigned 
material system) with respect to the oo^ axes coinciding with the 
lines of the versors X proceeding from P. As we know, the law 
of variation of K becomes expressible geometrically if we intro- 
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duce the ellipsoid of inertia Ey of centre P: to any direction of 

X there belongs a value of jfiT, viz. 7““, where Q is the intersection 

of the line of X with the ellipsoid E\ the three axes of E corre- 
spond to stationary values of K (with respect to neighbouring 


directions), the axes of maximum and minimum length in parti- 
cular corresponding respectively to minimum and maximum 


values of K. 


That being so, the same law of variation will hold for K the 
curvature; but in the general interpretation is necessary. 
This implies merely that the ellipsoid E (like, for example, 
Dupin’s indicatrix for the case of an ordinary surface) takes us 
in thought outside the F3, as an auxiliary representative element 
to be associated with the Euclidean three-dimensional space 
which is tangential to the F3 at P when, as is always allowable 
(Chapter V,p. 121 ), we imagine the F3 immersed in a Euclidean 
8 ^ (N >6). 

The outstanding result is that there exist at every point P at 
least three mutually orthogoyial directions Xj, Xg, X3 to which (or 
rather to the normal sections perpendicular to which) belong 
curvatures tvhich are statio'nary with respect to those of adjacent 
sections. These directions are called principal directions of curva- 
ture, and the corresponding values cd,, oj^ of K principal 
curvaiures. In general, that is when the three a»’s are distinct, 
the principal directions are uniquely determinate (ellipsoid with 
three unequal axes); when two principal curvatures are equal 
but differ from the third (ellipsoid of revolution), e.g. = 

4= u>3, only the principal direction X3 is uniquely determinate, 
while every pair of directions Xj, Xg orthogonal to X3 and to each 
other can be considered principal. If the tliree principal curva- 
tures coincide (sphere) the curA ature K is the same tor all sections, 
and every set of three mutually orthogonal directions is a prin- 
cipal set. 

All this of course can be established by purely algebraic 
methods: we have only to avail ourselves of the theory of quad- 
ratic forms and their transformations. Let 




1 


. ( 41 ) 


be two quadrics, one of which at least is definite^ Bay (f>; the inde- 
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pendent variables being A' (i = 1* 2, . . . n). Then the following 
facts are well-known^: 

(1) If we consider the ratio 



as a function of the A’s, and look for the values of these variables 
which make 8a> = 0, we are led to the system of n linear homo- 
geneous equations 

Sj (a,^ — aM,j) =0 (i == 1, 2, . . . n); . (42) 

1 

these are satisfied by values not all null of the A’s if and only 
if the determinant of the coefficients vanishes, so that the oi’s 
are roots of the equation of degree n: 

II II = .... (43) 

called the characteristic eqtmtion. 

(2) If AU' * are two solutions of equations (42) corresponding 
to two distinct roots oi, ca' of the cliaracteristic equation, there 
exists between them tlic lelation (of orthogonality) 

A' A*' -= 0. 

1 

(3) The characteristic equation (43) has its n roots a>i,ai 2 , - - -ci>„ 
all real (distinct or coincident). 

(4) To each simple root of (43) corresponds, in the mani- , 
fold Vn the ds^ of which has the a,y’s as its coefficients, one and 
only one direction whose parameters satisfy (42) forcu = a>/^. 
With each root of multiplicity /a (> 1) we can in an infinite 
number of ways associate /x mutually orthogonal directions in 
F„, whose parameters are independent solutions of equations 
(42) when we give co the value of the said multiple root. 

From all this there results that it is povssible in every case 
to set up at least one set of n mutually orthogonal directions 

(A = 1, 2, ... n), (miiquely determined in the general case, 


^ Fur proofs, see for example: niANCTU, hezioni di geometria analiiica^ 
Appendix; (Pisa: Spoorri, 1915): or JIhomaviuh, Quadratic Forms and their 
Classification . . . (Cambridge tracts on Mathematics ... No, 3). 
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in which the roots of (43) are all simple), the parameters of 
which satisfy the system (42), with oj equal to the corresponding 

By means of these X//s we can obtain the so-called canonical 
expressions. We begin by introducing the moments 

^.|i (h, i 1, 2, . . . n), . (44) 

1 ■ 

and observe that, by associating with the n identities 

1 (A == 1,2, ...«) 

1 

the ^n(n 1) conditions of orthogonality of two different direc- 
tions X/i, of the n-ple set 

K\j 0 4 = fc), 

I 

we obtain altogether the relations 

S^AiA,,, - {h,kr- 1,2, 

which express the noteworthy fact that the n^ parameters A;^ 
of an n-ple orthogonal set are the elements remprocal {in an algebraic 
sense) to the n^ rnornents in the determinant H A/^|^- 1| which they for7n\ 
and vice versa that the 7nome7its are the reciprocals of the parameters 
in the corresponding deterininant || A{ || (cf. Chapter IV, p. 74). 

Further, besides the relations just written which refer to the 
columns, their analogues with respect to the rows also hold good; 
these may be written 

|:;*AiA^„=^8i (i,i- 1,2, ...n). . (45) 

Taking account of this, if we multiply (44) by A/^i;;. and sum 
with respect to h, there results immediately 

V 

Uife ™ S,, A/^j,; A/t| 4 . (i, /c -- 1,2,.. n), , (46) 

1 

which are expressions for the fundamental tensor a^ in terms of 
the moments of any orthogoiml n-ple set. 
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Consider in particular the n-ple set (or one of the n-plc sets) 
A/i, the parameters of which, A{, satisfy (42) where, for each index 
A, a» = These equations, in virtue of (44), may be written 

n 

“ ^h\\i 0^9 — 1 , 2 , . . . n). 

Multiplying by and summing with respect to the index Ii, 
while attending to (46), we obtain the canonical expression for 
the to be associated with (46), viz. 

n 

^t/f ^/i|t (^'» ^ 1, 2, ^ , W/), . (47) 

1 

After this the simultaneous reduction of the two quadrics to 
orthogonal form is easily ettected by substituting for the original 
variables the n linear combinations 

2;, - i:,A„,A^- {h - 1,2, ...n). . (48) 

1 

In fact, when we substitute in (41) the values (46), (47) of the 
coefficients, taking account of (48), there results 

<f) = iff — , , . (49) 

1 I 

The condition that the A' ’s should be parameters, that is to say 
that the expression (41) for <j> should have the value 1, becomes 

n 

in the new variables z, 2;, Z/^ = 1. The mode of variation of 

1 

when the A’s are parameters of direction, is identical with that 
of the ratio when the variables are independent; or, if we 

^ n 

wish, of the quadric 0 — when the z’s are connected 

n 1 

the relation S/, = 1. 

1 

Moreover, the stationary values of 0 in these cases are pre- 
cisely the roots of the characteristic equation (43). 

The form 

1 
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is an obvious generalization, for any value of n, of the expression 
(40') for jBl, which defines the local distribution of curvature in 
a F3. 

The above considerations manifestly apply also to the 
behaviour of the curvature K when the direction of the section 
is supposed to vary, a topic which has already been discussed in 
a more elementary way. 

13. Geodesics infinitely near a given geodesic. 

We shall conclude this chapter with a discussion of the 
extension to n dimensions of a classical formula due to Jacobi, 
which defines in a very simple way the aggregate of those geo- 
desics ff of a surface which are infinitely near a given geodesic 
jSy called the geodesic base, Jacobi gives the linear equation 

(J) 

where y denotes the distance (normal) of any jK)int M of g from 
the base, a the arc of the base measured from an arbitrary origin 
O up to the projection P of M upon P, and K{a) the (Jaussian 
curvature of the surface at P. 

(J) is simply, in Poincare’s phrase, the equatio7i of variations 
of geodesics starting from B, There can be deduced from it, as 
we know/ some very remarkable consequences with respect to 
the behaviour of geodesics in the immediate neighbourhood of 
the base, the nature of the surface intervening only through its 
total curvature K. This is obviously an intrinsic question, 
depending entirely on the metric of the surface (as defined by 
its and not at all on the different configurations which the 
surface can present in space. 

It naturally suggests itself that Ave should try to extend the 
study of this subject of geodesic deviation to a Riemannian 
manifold of any number of dimensions. We have long had, 
of course, the equations of Lagrange <3efining the geodesics of a 

in a form the convenience of which, whether from the point 
of view of theory or of notation, is all that could be desired. 

^See, for example, Daruoux, TMoric des surfaces, Vol. Ill (^I^aris, Gauthier- 
Villars, 1894; new impretfsion 1923), Chaji. V; or Blabohkk, Vorlesungcn iiher 
Differentiafgeometrie, VoL I (2nd edition, Berlin, Springer, 1924), §§ 83-88, 
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These equations we may therefore use for the purpose of forming 
the equations of variations. Then with the help of Uianchi’s idea 
(Chapter V, p. 139) of the derivcUivc of a vector attached to 
the points of a line of F^, we reduce these equations to a condensed 
form, geometrically suggestive and of course invariant. The actual 
construction of the equations (linear of the second order, n in 
all, with the same number of unknowns) requires no further 
data than the base B and the metric of the manifold (especially 
Riemann’s symbols) along that curve. 

We find that this system of equations admits a linear first 
integral, which in its turn leads to a linear relation in finite terms 
among the unknowns. We are thus left with a system of n — 1 
equations or, in the special case of an ordinary surface, with a 
single equation, as in Jacobi’s classical result. 

To bring the final system of equations to as simple a form 
as possible we have to make a suitable choice of variables. Now 
we have already seen f§ 11, p. 164) that it is possible, begin- 
ning with any co-ordinates to defane new co-ordinates y in 
terms of which becomes locally Cartesian, in the sense that 
the derivatives of the coefficients a,*, all vanish at an assigned 
point 0. We have also seen (Chapter VI, p. 167, footnote) that it 
is possible, any curve B being given, to chouse co-ordinates y 
for which ds^ has this locally Cartesian character at every point 
of B. When JB is a geodesic, the system of co-ordinates y whicJi 
(§17) will finally be used has the following properties, which we 
merely state here without proof.^ Let M be any point in the 
immediate neighbourhood of B, P the orthogonal projection of 
M upon B. Then y„ is the arc of the base B measured from an 
arbitrary origin O up to P; the yf& (a = 1, 2, ... n — 1) may 
be regarded as components of the elementary vector PM in 
n — 1 directions mutually perpendicular and all perpendicular 
to B, chosen arbitrarily at O and carried by parallelism 
along B. 

14. Geodesic deviation in an n-dimensional manifold. 

Consider, along with the geodesic 5, any other geodesic g 
(more precisely, an arc of g) belonging to the immediate neigh- 

^ For a complete develoi>rnent of this point, compare the paper “Sur I’^cart 
geodtkiquc ” in MathtvvUuchc Annalen (Vol. 97). 

( I) 066 ) ® 
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bourhood of Corresponding to every point M of gr, take the 
jx)int P olB having the same as M and the rest of its z/’s ^ 0. 
It is important for what follows to fix precisely the relation 
between an elementary arc ds of g and the corresponding arc 
da = of B. Throughout the neighbourhood of B wc have, 
for the coefficients of ds^ in terms of the co-ordinates y, the 
Euclidean values 

6 ,, = 0 (i4= k), ba 1 , 


neglecting quantities of the second order. 
It follows that for anv curve whatever 


ds — 



the quantity under the radical differing from its exact value only 


by terms of the second order. For both the y^ and the 

may be regarded as infinitely small. It follows that 

ds ds 


dya 

dyn 


dy„ 


da 


to a second approximation — a generalization of the elementary 
fact that a segment infinitely near (with resf>ect to direction) 
a given right line differs from its projection on the line by an 
infinitesimal of the second order. 


15. Invariant form of the equations defining geodesic deviation. 

We have now to form the equations defining in general co- 
ordinates the behaviour of any geodesic g infinitely near B. 
Put 

X ,-- .... (50) 

where the ^*’s and their derivatives with respect to a are infinitely 
small. 

The f^^s represent the increments of the co-ordinates of 
a point P of B which passes to a corresponding point M of g: 
they can be regarded as the contravariant components of the 

^ In the siriat sense, that is to say, with the understanding that not only are 
changes of position of corresponding pr>ints on B and to bo small, but also 
changes of direction of corresponding tangents. 
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elementary vector PM = Adopting a more general point of 
view, we shall now regard this vector as not necessarily perpen- 
dicular to By its orientation depending on the law of corre- 
spondence between the points P and M of our two geodesics. 
We can therefore no longer assume that the elementary arc 
ds of g is (as in § 14) equal to the corresponding arc da of B\ 
but, the displacement in question being always infinitely small, 
we can foresee all the same that, if we put 

t-l + A, (51) 

da 


the elongation (or coefficient of dilatation) A remains infinitely 
small with the ^*’s and their derivatives. This will be proved 
formally in a moment. Meantime, differentiate the formulas (50) 
with respect to cr. We have 


ds 

da 




== 4 >.+ 


da’ 


(50') 


d^ 

where is written for — and <f>i for 

ds 


da 


A second differentiation gives 



*. = A + 


da^' 


From (51), d^s dX 

da^ ~~ da 


and (60') can be written 


— Aa'i + 


d^, 

'da’ 


all this holds without any assumption concerning the smallness 
of A. 


dX 

Considering now A and ^ as infinitesimal y and neglecting all 

terms of the second order, we may replace ^ ^ Xi by 

da^ da da 

and consequently take the equation for to be 




(50") 
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Now (Chapter V, p. 134) for any geodesic g we have 

ft 

Xi= ~ SjA {gh, Xj Xk {i = 1, 2, . . . n)- 

1 

/ds\^ 

Multiply these equations by ( ^ ) , and make the following sub- 

ds 

stitutions: (50) in the [jh, i\, (50') in the -- x , (50") in the 
Remembering that 

. . n . . 

~ ^jk i } 

1 

since J3 is a geodesic, and of course retaining terms of the first 
order only, we find 


d<T^ da^^' 




Next, for the sake of showing explicitly the contravariance of 
the <fti (parameters of the direction tangential to B), put 

4>,{a) -= 6' (i 1, 2, . . . n). . . (52) 

The equations just obtained (if we also change the indices i and 
j into T and i) become 

dX,.. , f.. ..jdi’' 


= — 6‘6‘ (6: 


^t/lk O 

1 OX 


We proceed to transform these equations for the sake of showing 
their invariant structure with reference to any change of co- 
ordinates. For this purpose we introduce Bianchi’a conception 
of the derivative vector of a vector ^ given as a function of the 
points of a line (Chapter V, p. 139). 

If B is the line, the contravariant components of the vector 
the derivative of are given by the equations 

(DIY = 1^- + S,A { ih, r} 6^ e (r = 1, 2, . . . n). (54) 
oa i 
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For co-ordinates which are Cartesian either rigorously, or else 
locally along the line B, we have simply 

mr - 

With the help of the vector it is easy to give an explicit expres- 
sion for the elongation A, without making any hypothesis as to 
its order of magnitude, the equations (50) and (50'), which we 
shall use, being exact. 

On g we have identically 


^ 1- 

1 


/ds\^ 

If we multiply on both sides by ( ^ replace (in the a^^^’s) 

ds 

the by their values (50) and the by (50'), we find at 

da 

once (neglecting terms of the second order with respect to 

IfX --- 1 4- -f 

\aa/ 1 aa x a Xf^ 

We have, moreover, the well-known identities (Chapter V, p. Ill) 

r)n ” 

p*- == S, [«/. [M, h} + a,, {^7, h}l 

OXi 1 

( ds\^ 

we replace the indices 

r, hhj k, I and take account of the identity just written down, 
replacing also, as in (52), 

n . n , n 

<f>l^ 

111 
by 6^, 


we find, on account of (54), 


/dsy 

\da/ 


l + 22,(Z)?)’-6,. 
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The vector D% is infinitely small at the same time as the 
and their derivatives. We have, therefore, by extracting the 
root, neglecting terms of the second order, and attending to the 
definition (51 ) of the elongation A, 

.... (51') 

1 

which shows its infinitesimal character. 

Naturally admits in its turn a derivative vector 2)^5- 
contra variant components are defined, from (54), by 

+ s,, { kl, r} 6*- {D^y. 

Introducing on the right-hand side the expressions (54) for {Dl^Y 
and {D%Y (after making some literal changes in the indices), we 
obtain 

+ r} [ih, (55) 

1 (ta I 

the second members, like the first, constituting a contravariant 
system. Bringing out explicitly the dilferentiation with respect 
to 0 -, and making some changes of indices, we can write 

+ 2 {oh -f S w, (65') 

where we piit for brevity 

s<"> = u e + s,, {ih, r) e f- 

\ oXf^ 1 acr 

+ r) {iJc, 

We may note in passing that, in the auxiliary quantities 
the index r is purely ordinal: we have placed it above, but in 
brackets, so as to avoid the suggestion that the S^^^’s form 
a contravariant system, which they do not. 
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For our purpose it is sufficient to replace in the derivatives 
^ by their values 




as given by the equations for geodesics (Chapter V, p. 134), so that 
we may write 


n 



1 


d{ik , y} y, ^ 

9®* 


I" 


91 

+ ^ilhk 
I 


[{Ih, r} [ik, 1) — {Iky r] {ih, 


(56) 


If then we add to the two members of equations (53), attend- 
ing to (55') and to the definition of Riemann’s symbols (§ 2), 
the equations take the invariant form 

(r = l,2,...n). (57) 

WJ 1 


16. Geodesic deviation. Specification of the differential system. 
First integral. Linear relation in finite terms. 

The system (57), taken along with the value of A given in 
(51'), contains n + 1 equations, with the same number of un- 
knowns; but it is easy to foresee, from the method of obtaining 
it, that it cannot by itself determine completely all the unknowns; 
there ought to remain an element of indeterminateness arising 
from the arbitrary nature of the law of correspondence between 
the points P of B and M of g. More definitely, we can prove 
that the definition (51') of A, or rather the relation derived from 
it by differentiation 

0 . . . (61") 


is a consequence of equations (57) themselves. To establish this, 
note first that, for any vector v whose contravariant components 
are we have 


d 

da 


n 


S, v’’ 


” dif ” 

S,.7 

\ 1 


dbr 

do 
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dh 

But the derivatives , of the moments of a geodeac B satisfy 
dxr 

the equations (Chapter V, p. 134 and p, 139 (52')) 


which express, we may say, the autoparallelism of the geodesic 
B in terms of its moments b^. The preceding identity, after inter- 
change of the indices h and r in the last sum, therefore takes the 
form 

= S,.(Dv)''6,. . . . (58) 

da i 1 


In virtue of this equation, the first member of (51") now becomes 
(if we replace the vector v by D%) 


dX 

dcr 




That this expression vanishes we can easily prove from equation 
(57), making use of the properties of Riemann’s symbols, as 
follows. Multiply both sides of (57) by and sum with respect 

n 

to r, noting that 6^ — 1. The right-hand member can be 
1 

written n 

— {ir, A7c} hP b' b'^ 


We now sum with respect to r, thus changing the Riemann 
symbol to the s 3 '’mbol of the first kind, and then make use of the 
antisymmetry of {ip, hJc) in i and p (p. 179), from which it follows 
that the sum is zero. 

Equation (61') is therefore simply a particular integral (or 
invariant relation) of the system (57); its role reduces to that 
of fixing one of the constants of integration. As the system (57) 
contains n -j- 1 unknowns, to make it determinate it is necessary 
to associate with it some other condition — a circumstance easy 
to understand from the geometrical point of view, since we have 
still to fix the law of punctual correspondence between g and th^ 
base, 
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From a formal point of view the easiest way to complete 
the system (57) is to cut out the unknown A by putting 

0 . 

dcj 

The equations (57) are thus reduced to the normal form (Chapter 
III, p. 36) 

{D'%y = — {ir, hk) {r = 1,2, ... n); (I) 

and wc see that, on account of the identity (51"), the system (I) 
admits the first integral 

by X constant, . . . (II) 

1 

expressing the fact that there is a constant linear dilatation 
when we pass from any arc of B to the corresponding arc of g. 
Since, on account of the identity (58), the first miEsmber of 

the integral (JI) is simply the derivative of f ' 6,., it follows that 

1 

every solution of the differential system (I) gives also 

i,rb,= Xa^C, .... (Ill) 

1 

where C is a second constant. 

If in particular we take A ^ 0, we see that we can associate 
with the differential system the relation 

C. 

1 

This gives the translation into analysis of the obvious geometrical 
fact that w^e can assign the correspondence between the points 
M of g and P of B in such a way that the (infinitely small) vector 
PM will have its orthogonal projection upon the tangent t to 
the base at the point P equal to a constant C, Such a law of 
correspondence implies, in virtue of (II), that there is no altera- 
tion of length (A == 0) as between the arcs of B and the corre- 
sponding arcs of g. To particularize still further, if (7=0, 
we arrive at the orthogonal law of correspondence {PM perpen- 
dicular to B) considered in § 13. 

(DC55) 
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It is scarcely necessary to add that, in order to substitute 
other geometrical laws of correspondence, we have only to 
associate with the system ( 57 ) the analytical translation of the 

law chosen, instead of the law - == 0, For example, if we wish 

CUT 

PM to be inclined to B at an angle 0 (constant or a given func- 
tion of a) the additional equation will be 

--- ^ cos*A, 

1 

where ^ = I ^ 

I 1 

represents the length of the vector 

In a case like this, some slight supplementary discussion of 
the complete system will be needed — its reduction to the normal 
form, determination of the number of constants of integra- 
tion, &c. 


17. Reduced form of the differential system (I) in terms of 
the co-ordinates y. 

We now return to the co-ordinates y, and fix definitely on 
the orthogonal law of correspondence between the base B and 
any geodesic in its neighbourhood. As we have just seen, such 
a correspondence is expressed analytically by the differential 
system ( 1 ), with the sj)ecifications A C -- 0 of the constants 
of integration connected with (11) and (III). 

As remarked in § 14, the co-ordinate of M is identical with 
that of P. Since the other co-ordinates (a = 1, 2, . . . n — 1) 
of P are 0, the variations 77* of the co-ordinates y arc respectively 

(a = 1, 2, ... n — 1), — 0, 


thus justifying the name of Cartesian components of the normal 
displacement or deviation PM = *»j which we give to the ly^’s. 


Moreover, the parameters = 


dVi 

dee 


of the base B vanish 


for i = 1 , 2 , ... n — 1 ; and 6" — 1. ChristofEel’s symbols also 
vanish along B, as well as their first derivatives with respect to 
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a (or, what comes to the same thing, to and consequently 

{DriY = 

aa 

Equations (I) thus become 

'S' "" (a =. 1. 2. . . . n - 1), (F) 

« - 1 

0 — 'Lp{nn, np}yp, 

where, in both sums, we have suppressed the term corresponding 
to the value n of the index, since every Riemann’s symbol 
which has its last indices equal vanishes (p. 177). 

The first grouj) (!') (comprising n — 1 linear equations of 
tin* second order) defines the n — 1 Cartesian components of the 
(normal) deviation PM. The last equation reduces to an identity, 
as may be seen as follows. Riemann’s symbols of the second 
kind are in all cases connected to those of the first kind by the 
relations „ 

{ijjik) = 'L,ajr{ir,hh)\ 

we have, moreover, for the symbols of the first kind (§ 4), 

{ij, Kk) — {ji, hk). 

In our case the coefficients a,, of ds^ reduce, on B, to 0 (for r 4= j) 
and to 1 (for r - j). We have therefore, on the base, equality 
between symbols of the two kinds whose indices are the same; 
and, in particular, 

{n«, n^} — {nn, n^) — 0. Q.E.D. 

Of course, the integral relations (11) and (III) are of no further 
account, being now identities on account of the vanishing of the 
b‘ ’s (i = 1, 2, . . . n — 1) and of the «th component — t;’* of 
the displacement PM. 

18. Case ot n — 2 — formula of Jacobi. 


For n — 2, that is to say for an ordinary surface, if jB is 
the geodesic base, — *he arc cr, and y^ (= y) the normal 
distance from M to B, the system (F) reduces to the single 


equation 


d^y 


- {21,21}y. 
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Now (§ 9) for any co-ordinates whatever, the Gaussian curvature 
£ of a V 2 is expressed by the ratio 

(12, 12) _ (21, 21) 


a denoting the discriminant of the d*® of ¥ 2 - 

For our co-ordinates, which are Cartesian along B, a — 1, 
and Riemann’s symbols of the second kind arc (§ 17) the same as 
their homologues of the first kind. The equation defining y is 
therefore none other than the equation of Jacobi (§ 13) 

z + = «• 


CHAPTER VITI 

Relations between two Different Metrics referred to 
THE SAME Parameters; Manifolds of Constant Curvature 

1. Differences between Christoffel’s symbols relative to two 
different metrics assigned to the same analytical manifold. 

We introduced in Chapter IV notions of temor, covariance, 
&c., relative to an analytical manifold i.e. to the aggregate 
of n variables ccj, Xg, . . . we then, in the third part of the 
following chapter, considered the metrical manifolds obtained 
by associating with an analytical manifold a specified (but 
arbitrary) positive and definite differential quadratic form. 

There is clearly no reason against assigning in turn to the 
same analytical manifold two distinct metrical determinations, 
defined by the two quadratic forms ^ 

It 

ds2 = 'L^a.^dx^dx^, .... (1) 

1 

~ S-,. a^, dxi dxff, .... (T) 

1 

*As a geometrical interpretation, we can think of two distinct whose 
points are in one-tti-one correspondence, «<;> that a set of n values assigned to 
can be represented either by a point P of one, or by the corresjKinding 
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From each of these forms we can obtain a set of Christoff el’s 
symbols, which we shalf denote by 

{^7i, r} and {iA, r}' 

respectively, and from these we can construct Riemann^s symbols 

{ ir, hk } and { ir, hk]\ 

and the analogous symbols of the first kind. In this chapter we 
propose to find the relations between the symbols relative to the 
two metrics, and then to apply the results to geometrical con- 
siderations. 

We shall begin by forming the differences 

[ih, r)' — {ih, r) pi:, ... (2) 

we shall justify the positions of the indices on the right by showing 
that the p’s constitute a tensor covariant with respect to i and 
It and con Ira variant with resjiect to r. 

To prove this, consider an arbitrary contra variant system 
whose elements are functions of position, and a system (also 
arbitrary) of increments dxi, of the independent variables. We 
know (of. Chapter V, p. 138) that the exi)ressions of the type 

= di'' L,,, {ih, r] dx,, 

1 

constitute a contra variant system. The same result is of course 
true for the analogous expressions corresponding to 

t " - d$'-+l,^{ih,rYedx^ 

1 

and also for the differences 

1 

The fact that this expression is contravariant means that, 
denoting by u,. an arbitrary covariant simple system, the expres- 
sion 

2, (r"’ — rO u, = Xif,, ^ ‘ dx,, Ur 
1 1 

point P' of the other. E.g. a map and tlie surface of the earth are two Fg’s with 
different metrics (one is Euclidean, the other not), and to every pair of values, <p 
(for the latitude), X (for the longitude), correspond one point on the map and one 
p<nnt on the earth. 
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is an invariant; and if we examine the right-hand side of this 
equality we see that its invariance requires that should be 
a tensor of the kind stated. 

It will be convenient for subsequent purposes to introduce 
also the associated covariant system 

n 

Pihj ^jr Pth (^ ) 

1 


2. Differences between the covariant derivatives. 

Given a generic tensor where as usual (?*) and (It) denote 
the aggregates of m indices % . . . and fx indices respec- 

tively, we can consider its covariant derivatives with reference 
to either the first or the second fundamental form, i.e. with 
respect to either or ds'^. A generic element of the system 
obtained by differentiation relative to the first form will be 
denoted as usual by the analogous expression relative 

to the second by We wish to evaluate the difference 



To find it we can use the explicit expression (p. 146, 
formula (4) ) for the covariant derivatives of a generic mixed 
system. These arc linear in Christofiers symbols, so that the 
differences in question will be linear in the p’s; the expression 
for them will be 




■ *a-l i ^a+1 • • • hn 


( 3 ) 


These general formulae can also be obtained, without using 
any special memoria technica, from the original definition of 
oovariant differentiation, with respect to a given fundamental 
form, of a generic tensor. It is therefore well to remind the reader 
that, for an arbitrary displacement dxi, we assigned to the 
symbol d, when prefixed to a function of position, the usual 
meaning of the infinitesimal increment (the differential) caused 
by the displacement (cf. Chapter VI, p. 145); while for a generic 
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vector % and its contravariant components f*" we assumed 

r} ... (4) 

1 

i.e. we defined the expressions as the increments dependent 
on parallelism. All of this referred to the metric ( 1 ), which was 
then supposed fixed once and for all. We can of course follow 
the same procedure taking (!') as the fundamental form; but to 
avoid ambiguity it will be well to denote by d' the increments 
of the ^’■’s due to the same displacement as before, so that we 
shall have ^ 

dr == . . . (4') 

1 

It will also be useful to introduce the operator 

d* d' - d. 

Since for functions of position d' and d have the same meaning, 
we have 

d^f^- 0 

for any function of position /; for the contravariant components 
of a generic vector ^ we have, subtracting (4) from (4'), 

d^e -- - kk Pk ^ ‘ dx,, {r - 1 , 2 , ... n). . ( 6 ) 

1 

Similarly, given the covariant components % of some other 
vector u, we find that 

d*u,, %ip^^Ujdxi (5') 

Now, in order to prove (3), we need only consider the invariant 
multilinear form F whose coefficients are the elements of the 
given tensor we laiow that the covariant derivatives 

are merely the coefficients of dF and d'F respectively. 
If now we take the identity 

d'F - dF = d^F, (6) 

and apply the operator d* on the right, using the property 
d^f == 0 for any function of position/, and (5), (5'), then 
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equating coefficients of like terms on each side we get 
formula (3). 

As a very simple application of this process, we shall deter- 
mine directly the values of the differences (A^)/^ — of 

the derivatives of a covariant simple system A^.. We start 
from the invariant form 

( 7 ) 

I 

and consider the usual generic displacement, determined by the 
increments dx^ of the independent variables. We shall have, 
with reference to ds^, 

dF I frT' .... (8) 

and with reference to 

d'F (8') 

1 

Further, applying tlie operator d* to F, and remembering 
that d*Ay. is zero, and that is given by (5), we get 

d*F =- dx,. 

1 

The identity (6) therefore takes the form 

S,* [{A,)l — A,, ^'\i’'dx^ —'L, A, p% t dx,,. 

] 1 

Replacing on the right It by k, and interchanging i and r, 
we get the typical term on the right also in a form involving 
dxjf\ hence, equating coefficients, we have 

{A^)i. — ... ( 9 ) 

This is the particular case of (3) which we shall require in 
the next section. 

3. Diflerenoes between Riemann’s symbols. 

We propose in this section to calculate the differences 
Hihk = {*>. ^dcy — {ir, hk]. 
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which, being differences of two like tensors, are by definition 
tensors of the kind indicated. The calculations could be effected 
directly on the expressions defining Riemann’s s 3 rmbols (Chapter 
VII, p. 175), but the long formal expansion can be avoided by 
the following method. 

Let Ajr be any covariant simple system, and any contra- 
variant simple system, and consider the invariant form (7). 
Applying to it the operator ^ A M — dB with reference to 
ds^ (cf. (Chapter VII, p. 173), and remembering the fundamental 
properties of this operator, we shall get 

AF S, 

1 

or, expar.ding by formula (4) of Chapter VII, 

[ir, hk) t" A^dx,,hx^. 

1 

Similarly, with reference to ds*^^ we can introduce the operator 
A' — S'rf' d'B\ and write 

MF --= — {ir, M] A,, dx,, Sx„] 

1 

and subtracting the former equation from this we get 

(A' - A)i^ ^ - S,,,, R:,, ^ ' A, dx, 8x,. . (10) 

1 

We shall now obtain by another method the expression for 
the same quantity as a quadrilinear form in the quantities 
A,.dx,8xf^, and hence, equating corresponding coefficients of 
the two forms, we shall find the expression for 
Note first that 

A' - A -- {8'd' - d' 8 ') -- (8d - d8) 

-- (S'd' ~ 8d) — (d'S' ~ dS). 

Since the second expression in brackets is obtained from the first 
by interchanging d and S, we need only calculate the expression 

^To avoid ambiguity we have here replaced the synibola 5 and used in 
Chapter VII to denote two distinct systems of increments, by d and 
respectively. 
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for h'd' — hd. Fisher, in making this calculation we can ignore 
all those terms which remain unchanged on interchanging d and 
S, since they will disappear when we take the difference; we shall 
denote them collectively by X{d, 8). 

Introducing the notation 

d'— d = d*, 8' - S = 8*. 

we have 

8'd' = (8 + 8*K 8d'+ 8*d' 

= 8{d + d*) H- 8*d' 

= 8rf + 8d* + 8*d', 

so that 

8'd'— 8d -- 8d* -f 8*d'. 

We therefore get 

(A' — A)F = 8d*F + S*d'F — (dS*F + d*S'F). 

To calculate the first term we first apply the operator d* 
to the form F, remembering d*J, = 0 and (5). We get 

d*F =- i:,J. d*i’ 

1 

1 

Applying the operator 8 to this form we get, from the definition 
of covariant differentiation, 

Sd*F ^ p’liX 8xk 

1 

n 71 

Pih \k ^ ^irhk Rik \k ^^k* 

1 1 

Observing that the second sum can be written in the form 

n 71 

^rk ^r\k ^^k Pih f 

1 1 

we have, applying (5), 

hd^F = - ^.h7kP^ih\ki" A, dx^hx^ + (11) 
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To calculate the second term we apply the operator 

S* to (S'), and get 

S^d'F = + 

1 1 

In the second sum, we can substitute for {ArYic the expression 
given by (9), and we get 

8^d'F ^ ^,,{AXr8*dx, 

1 

+ S,, A,^,8* rdx, - A,8^rdx,, (12) 

1 1 

We must now add (11) and (12). In doing this, we notice 
that the first sum in (12) is symmetrical in d and 8, since expanding 
8 *dxf^ by (5) it can be written as 

^rkih (Ar)k i* Pih dx i SX/^; 

1 

while the second sum in (12) and the second in (11) change one 
into the other if we interchange d and 8, so that their sum is 
symmetrical. There remains therefore 

8 d^F + 8 ^d^F 

— ^{d, 8) — A ^dxf^8xff — p]fc Ai8*i^ dxf^. 

1 1 

In the last term we substitute for 8*^^' its explicit expression, 
so that it becomes 

71 

— ^irkih P^rh p\h A^ 8Xj^ dXj^. 


In order to be able to collect the terms in the two sums with 
the common factor Aj.dXf^ 8 Xf^, we apply the substitution for- 


/r I i h k\ 
\l i r h h/ 


to the indices in the last sum, so getting 


8^^^^ + 8 *d'F = X{d, 8 ) - Za.rk[p:kik - :^ipi;nplk]i'Ardx^ 8 x,. 

1 1 

The expression obtained by interchanging d and 8 on the 
right (remembering the definition of X) is 

X{d^ 8) — ^ihrk [Pih lA- plh plk^ ^ * Ar SXf^ dX/^. 

I 1 
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IntercLanging h and h in the second of these, and subtracting, 
we get finally 

(A' - ^)F 

n n 

~ ^ihrk [Pih | * | A iPUi p\k PT* Pi*)] f * -^r S®*.. 

Comparing this with (10) we get the required formula; 

Rm = {i'T, hk\' — {ir, hh] 

— P\h\k — Po-lfc — (PmPik PlkPih)’ • (1^) 

which expresses the differences between Riemanu’s s 3 Tiibols of 
the second kind in terms of the differences between Christoffel’s 
symbols. The analogy between this formula and that defining 
Riemann’s symbols (p. 175, formula (3) ) should be noted. 

If we contract (13) by multiplying by and summing with 
respect to r, and then use the formula 

n 

Pihj \k Pih I k 

1 

obtained by covariant diflerentiation of (2') with respect to 
we get the co variant system 

n n 

— Pt/fJIk PikJlh (Pl/ijPik — Plk) p\h)- 

1 1 

It is to be noted that this does not give the differences between 
Riemann’s symbols of tlie first kind. In fact, substituting on the 
left the expression for (14) becomes 

n 

S,. ajr{ ir, Kk } ' — {ij, hk) 

1 n 

— Pihj\k Pikj\h Si {PlhjPik Plkj Pih)> (1^') 

1 

and the first sum is not the same as {ij, hk)', which would be 

hky. 

4. Case of two metrics in conformal representation. 

We shall now apply the formula (14) to the case in which the 
two fundamental forms (1) and (1') differ only by a factor. As 
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both forms are positive, this factor must also be positive, so 
that we can denote it by we shall therefore suppose that 

ds'2 = e2r ^52^ ( 15 ) 

or ds' = ds. 

The geometrical interpretation of this condition is quite 
simple, namely, that the correspondence between the two mani- 
folds is such that infinitesimal segments are proportional, or 
there is similanty of injinitesinwl 'parts. It follows that the 
angle between two curves (the angle between their tangents 
at the point of intersection of two infinitesimal elements) is equal 
to the angle between the corresponding curves; hence the name 
of cwnformal representation. 

In order to calculate (14), we shall obtain in turn, first, 
Christoft’crs symbols of the first kind for the two forms, then 
those of the second kind, from which we shall get the 
and lastly the and their derivatives. 

We start from the relations equivalent to (16) 

a[j, {i, — 1, 2, ... w) . (15') 


and shall calculate the vsymbol [ih, IJ, We get 



= ( [ih, 1] + au r,, + r^). 


where t;, stands for , &c. 

dxf, 

To construct the symbols of the second kind, or 

{iKry = 

1 

we observe that the coefficients are, by definition, expressible 
as the quotient of a determinant A^i of order n — • 1 (the comple- 
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mentary minor of a'j in the determinant || |1 ) by a determinant 

a' (namely, || || ) of order n. Remembering (15'), we see that 

in these determinants we can take out a factor which is 
common to every element, so that we can write 

a', I -- e"’’ oi! ~~ 

where A,.i and a denote the determinants corresponding to Ari 
and a', but relative to the coefficients o,.^. We thus have 

a'“ ^ 

and therefore 

it 

{ih, rj' ^ Sia’'( [^7^ 1] + aun + aiiiT, — 

i 

ih, r} + a; T,. + a;; r,- — a,,, T^ 

where the S’s as usual denote a factor which is 0 or 1 according 
as the indices are the same or different. 

The difference {ih, rj' — {ih, r} is therefore given by 

s; Tft -}- a^, T, — t’’. . . . (16) 

Multiplying this by anc! summing with respect to r, we 
get (using (2') ) 

Pifu = — ^'ih^r • * ■ 

By covariant differentiation with respect to ds^ we get 
Piftflk “i” 

subtracting from this formula the analogous one obtained by 
interchanging h and k (so as to form the first part of (14) ), and 
remembering that we find that 

Pikj\?c Ptkj\h ^ih^jk 

The second part of (14) can be constructed with the help of 
(16) and (16'). We shall first calculate 

n n 

S; piiy p\k = (ai r* + a,,j t, — ap, Tj) (a? t*. + aj^ — Up, t*) 

X 1 

= a.jTfcT/, + ’’’i Tjfc. 

^hj "^k ^ik 
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n 

where At ~ (the first differential parameter). The third 

1 

term and the last term cancel out. Denoting by X the aggregate 
of the four terms underlined, which are unchanged if we inter- 
change the indices h and we can write 

n 

Plhj Pik “b ^hj 

1 

We shall now subtract from this the formula obtained by 
interchanging h and k. We shall get 

u 

(Plhj p\k Plkj p\h) = (P'ih ^jk ^ik ^jh) 

1 

”1~ ^4* ^kj ^ih '^j “f” 

Using this and (17), we get the right-hand side of (14') in the 
form 

— (jjk ~ ('Tjh — + ^,h i'^ik — T“4 ) 

^jk (TUi '^h) iP'ih ^jk ®t/r ^jh) At. 

The left-hand side of (14'), using (15'), can be written as 

hk\' — {ij, hk) 

1 

or {ij, hky — (y, hJc). 

Finally, formula (14'), for two metrics in conformal repre- 
sentation, can be written in the form 

e * {ij, hky — {ij, hk) =--- — a,,, {rj„ — r, r,,) + a,*, (t;,, — t,) | 

+ (^jh {'^ik — r, T/,) — aj,, {Tij, — T, T,,) - {a,h <^jk ““ ^ik <^jh) 

This formula was found by Finzi, by another method, as early 
as 1903.1 

^Cf. “Tje ipersujR-rficie a tre dinipnsioiii cho ri possonn rafiprPHPntare cnriforme- 
rrieiite sullt) Hpazio euclideo”, in Atti del It. hi. Vcncto^ Vol. IjXII, pf). 1 049-1002. 
The later reflearches of Fiiizi and 8cht>uteu on the inanifoldM of any Tiiiiuber of 
dimensions which can be conformally represented in a Kuclidean space of the same 
number of dimensions, should also be mentioned. Cf. Rend, della R. Aca. dei 
Lincei^ Vol. XXX (first half-year, 1922), pp. 8-12, and Vol. XXXI (first half- 
year, 1923), pp. 21.5 218, and Schouten’s book cited at Chap. VII, p. 172. Cf. also 
Tl, J. Stuuik : Qrundziige der mehrdimensioimlen !Hfferential(jeom.etrie (Berlin, 
Springer, 1922), Ch. IV, § 13, p. 150, where Schou ten’s results are given with 
bibliographical notes. 
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This formula can be given a simpler form by putting 


u = e- 


80 that (15) becomes 


= J. ds^. 

m 2 


We then have 


Ui = — 

Uik — e-’- (t^ — TiT*), 


from which we get 

- «;»• 
Tf = — MfC^, Tfi — TiTj, = — — , 


M 


Am 


M'^ 


At = 2,ja**TiTfc = e®’' Sit a** Mi Mj = Au — 

1 1 

Thus (18) becomes 

M® (7j‘, AA:)' — (ij, hk) 

= flit ^ - Oit ^ — My. -f Mj* — (a^ a^t — Mit Ojt) ( 18 ') 

U U ’U U ■' 


At the end of this chapter we shall have occasion to point 
out an interesting geometrical application of this result. 

6« Isotropic manifolds. 

Leaving aside for a moment this order of ideas, we propose 
to study those F,/8 in which the Riemannian curvature, as 
defined in Chapter VII, pp. 195-198, does not depend on the 
section, but only (if it is variable) on the point. This is expressed 
analytically by the fact that the expression for K given by (31) 
of the preceding chapter is independent of the u's and the ?;’s. 
We shall see that these F,/s, which we shall call isotropic, i.e. 
with (locally) constant curvature, are characterized by a parti- 
cularly simple expression for Riemann’s symbols. 

We observe first that a fairly siraj)le algebraic combination 
of the coefficients which possesses the fundamental properties 
of Riemann’s symbols, is the following: 

^ij,hJr = yi^th^Jk ^ik^jh)? 

where y is a priori any function whatever of position. Everything 
reduces to proving that when these quantities are substituted 
for the symbols (ij, hk) in (31) of the preceding chapter, the 
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resulting value of K is independent of u and v. In fact, malring 
the substitution, we have 

K = -,A- 'Lijuc K.®;* — aucO'jk) 
sin^a 1 

= -5^ «'■ M* S,i. x/‘ — ^Uc ««• ^Jh <tjh 

sin^a L 1 1 ' 1 1 J 

and since „ „ 

1 

n 

S£*.artM‘V = "Lj^ajhV^u^ = corn, 

1 1 

it follows that 

£: = [1 — cos2a] =.- y. 

sm^a 


Hence the Riernannian curvature of a whose Riemann’s 
symbols are the expressions 6 ,^ is y, and is therefore indepen- 
dent of the section. But we can also show that this is the most 
general exj^ression of Riemann’s symbols which will make K — y. 
In fact, if we put 

{ij, hk) — bfj^ + B^j 


where b^j has the meaning assigned to it above, we shall show 
that B,j^ = 0 . To do this, we insert this expression for {ij, hk) 
in (31); the right-hand side can then be broken up into two parts, 
the first of which, containing the symbols b^j ^ 4 ., is, as we have 
seen, equal to y, and the second, which is 


1 

“• 9 ^Ohk^tj.kk 

sin^a 1 




must vanish if we are to have K = y. 

The sum just written can be simplified if we observe that 
since Riemann’s symbols are antisymmetrical, the two terms 

can be collected into a single term; putting 

= Pmc, 


tbie term becomes 
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Thus the sum for all the permutations hk becomes merely a 
sum for all the simple combinations {Ji, k) of unlike indices, since 
it is useless to consider terms with a repeated index {k — h), 
because Phh = 0- We shall denote the sum extended only to 

n n 


simple combinations of the indices by 

1 

quadruple sum thus becomes 


instead of 

1 


The 


'W hlcPhk* 

1 1 


Proceeding in the same way for the indices i, j, we get ulti- 
mately 

n n 

®/i/r hk Pij Pjiki 

1 1 


i.e. an expression bilinear in the p’s. Each summation will extend 
to m — ^n{n — 1) pairs; we shall number these (in any order) 
from 1 to m, and put 

Pij Phk ^yy hk "^09), (y) 

{i,j, h, k 1,2, ... n; y ^ 1 , 2 ,... m). 


where ^ is the ordinal number of the pair ij and y that of hk, 
so that the sum can be written 


m 

^^y (y) ^y 


It will now be clear that this expression cannot vanish for 
arbitrary values of the 2 ;’s, unless all the B’s are zero; which is 
precisely what we wished to prove. 

We may therefore conclude that for a F„ whose curvature 
is locally constant (i.e. independent of the section) and equal to 
a given function of position K, Riemann’s s 3 rmbols are neces- 
sarily given by the formula 

{ij,hk) = K{a^ajt — ai^ajk). . . (19) 

Multiplying by and summing with respect to j, we get 
the expression for the symbols of the second kind: 

{ir, hk} = Kia^X - a,.,SD- • • (19') 

The function K, however, cannot be arbitrarily assigned; 
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we shall show in the following section that for n > 2 it must be 
a constant. 


6. Sohur’s theorem. 


This theorem states that if the etirvaiure is locally constant, 
it is also the same at all points. The case — 2 is not considered, 
as there is only one section at each point, so that we cannot 
properly speak of locally constant curvature. 

We shall therefore show that the K of formula (19) is constant, 
or that Zi = 0 (Z = 1, 2, . . . «), 


where Ki represents a generic covariant derivative, identical 
(cf* Chapter VI, p. 147) with the ordinary derivative. 

To prove this, we take the co variant derivative of (19), 
remembering Ricci’s lemma. This gives 

(ij, hk)i = Ki (a.,, — o,-* a^^). 

Taking three distinct values for k, I (which is possible, 
since n > 2), the other two relations obtained from this one 
by cyclic permutation of h, k, I can be written in the form 

(ij, ^l)h = K,, (a^ a^i — a^ a^*), 

{ij, Ih)^ = Kjf {a, I Ojh ajj, Oji). 

Adding the terms on the left and on the right of these three 
equations, and remembering Hianchi’s identity (Chapter VII, 
p. 183), we get 

0 Ki (a^f^ ai!^. ajj^ + Ki^ aji a^i a^^) 

+ K,, {a, I ay. — a.,. Uy). (20) 

By varying i and j, we thus got ln{n — 1 ) relations, of which 
we shall now make a suitable linear combination. Multiplying 
(20) by a’* a^* and summing with respect to i, j, we find that the 
coefficients of Ki, Kf„ A”*, are all of the type 

1 

where a, j8 denote two of the indices A, ifc, Z; or, making the two 
summations in turn, of the 157)0 

%,a^a^%a^^a^ = 
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where the S’s as usual denote either 0 or 1. These quantities are 
therefore always zero, unless we have simultaneously a = hy 
P ^ k (which happens in the coefficient of the first term), in 
which case the value is 1. Our linear combination of the equations 
(20) thus reduces to 

Ki = 0. Q.E.D. 


7. Canonical form of ds^ for a manifold of constant curvature. 

Given a Euclidean space we propose to find, if it exists, 
a manifold with constant curvature K, which can be con- 
formally represented on or in other words (cf. § 4) such that 
its linear element is given by 


where ds is the element of We shall see that this is always 
possible, and the solution of the jjroblem will lead us to assign 
two important forms for the ds^ of a manifold with constant 
curvature. 

Kee.ping the notation of § 4, we shall have for liiemann's 
symbols for the the expression (cf. formula (19) ) 

(ij, hk)' K {a't, aj„ — a'* a',,) - ^ («,„ «,,- — a^*), 


and for 


{ij, hk) = 0, 


since for a Euclidean space all Riemann’s symbols are zero (cf. 
pp. 173-178). 

We must now substitute these values in the. equations (18'); 
these constitute a system of differential equations the integration 
of which will give the function u. Making the substitution, (18') 
becomes 


^th 


U 


u 


^Uc 


U 


u 



+ <^Jk 


u 


. . Aw K ^ 

— — f^UcOjh) - —- 2 — = 0 

(i, j, h, k — 1, 2, . . . n) 


^ ( 21 ) 
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These — 1) equations can be satisfied by putting 

w,vt = coa, (t, k = 1, 2, ... n), . . (22) 

where c is a constant; in fact, substituting these values, they 
take the form 

^2 "ja) (2cm — K — Am) =- 0 

(^, Jif k 1, 2, ... n), 

and in order that they may all be satisfied, we need only make 
the common factor vanish, i.e. put 

2cm — a: — Am 0 (22') 


We have therefore substituted for (21) the system composed 
of the equations (22) and (22'), which holds whatever may be 
the co-ortlinates x. If then we suppose, as we always may, that 
the x’s are orthogonal Cartesian co-ordinates of S^, so that 


:= 3f, Am ^ 



M.-fr == 


02m_ 

dXidXk 


our system will take the simpler form 




cSf, 



(23) 

(23') 


We shall examine separately the two cases c == 0, c 4= 0. 
If c = 0, the system becomes 


ox^d 


^ 0 , 


(24) 




(24') 


and from the second of these it follows that K <0, Such a 
solution is therefore possible only for manifolds of constant 
negative curvature, since we do not consider the case K = 0, 
which has no special interest, i.e. the case when is itself Eucli- 
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dean. Equations (24) then give, by an inunediate integration, 

n 

M = + 6, . . . . (25) 

1 

where the 6 „’b, and h are constants and therefore, substituting 
in (24'), 

£ + (25') 

1 

This shows that the 6„’s are not all zero, and that therefore, 
by applying an orthogonal substitution to the co-ordinates,^ 
(25) can be put in the form 

u ~ kx,^ {k constant) 
so that (26') becomes 

K k^ ^ 0, 


01 k = <*/ — K. We therefore have 

u — >/ — K X,, 


and therefore 


ds 


> 2 . ^ • • • 4- dx^ 

a; 2 


. (26) 


This is the canonical form of the line element of a manifold 
of constant negative curvature. It was found by Beltrami ^ in 1868 
by another method. 

Another type of solution which holds for any value of K 
whatever is obtained by supposing c 4= 0. (23) gives ms the two 
groups of equations 


d^u 

dx^dxjf. 


0, i ^ ky 


(27) 


d^u 

dx ^ 


(27') 


^ The hypcraurfacea u = constant, i.o. hv Xu — constant, are a set of parallel 

hyperplanes ; we need tliereforo only choose the axis Xn in the direction perpen- 
dicular t<> tliein in order tliat their etjuations may take the form a5n = constant, 
and therefore that u = k xit- 

“ Opere matematichey Vol. I, p. 419. Milan, Hoepli, 1902. 
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The first group has for its general integral 

u = (28) 

1 

where Xf is a function of alone. 

The second group gives 

X'/ = c, 

where differentiation is denoted without ambiguity by dashes, 
since the argument of Xf is a:,- only. 

From this, integrating once, we get 

X- = c(x,-x% 

where the arbitrary constant of integration has been put in the 
form — using the hypothesis c 4= 0; and integrating a second 
time 

where 6,- is a constant. Substituting from this in (28) and putting 

h = we get the following expression for «: 

1 

« = ^ S, (Xf — a;y)2 + 6, . . . (29) 

-5 1 


containing n + 2 arbitrary constants. 

We still have to consider (23'); substituting in it this value 
of u, it bocomes 2<* - X = 0 .... (23") 


and therefore merely establishes a relation between the two 
constants c and h. 

We have therefore obtained a solution containing n + 1 
arbitrary constants; we can choose these to satisfy specified 
conditions at a generic but fixed point O of S,,. E.g. suppose we 
wish to take the .x^’s in such a way that all the are zero at 
the origin. We have from (29) 

Uj = c (Xj — X®); 

hence every x" must vanish, so that (29) becomes (substituting 
for b from (23") ) 

^ ~ n "I” Q- (^) 

2 1 2c 
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We can then deteimine c so that at the origin « = 1; for 


K 

this we must have c == — , and we thus get finaUy 

« = 1 + y S. ajj? . . . 

i 1 


(30') 


This value of u makes ds'^ take the form given by Riemann: 

<?»*+ <?».,*+ . . . + dx '^ 

ds'^ - -5- = . “ .. ■■ (31) 






We shall show farther on (§ 2, p. 246) that the ds^ of any 
Vn whatever of constant curvature K can be put in the form 
(31), and also if K d 0, in the form (26); this will justify the 
choice of the term canonical forms for these expressions. 

Here we shall also prove the almost obvious property that a 
hypersphere of radius R in Euclidean space of + 1 dimensions 

constitutes a of constant positive curvature Jfii ^ , To 


do this we shall take y^, y^, • • • y^ to denote orthogonal 
Cartesian co-ordinates in so that 


ds^ — 


71 




‘2 


(32) 


Without loss of generality we can consider only the hyper- 
sphere whidh has its centre at the origin, and is therefore repre- 
sented by the equation 

ky/= ( 33 ) 

0 


We sliall prove the assertion in the most direct way, by 
expressing the n + 1 co-ordinates y of the points of the hyper- 
sphere, connected by the relation (33), in terms of n suitable 
curvilinear co-ordinates sr, and showing that ds^ takes the required 
canonical form (31) when these parametric expressions of the 
j/’s in terms of the x’s are substituted in (33). 

The parametric representation of the ij s referred to is an 
immediate generalization of that given for an ordinary spherical 
surface by stereographic projection. In this case (n = 2), if 
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we project from a point whose co-ordinates are ~ — R, 
y\ ~ Vi — ^ iipon the tangent plane at the diametrically 
opposite point, every point y^, y^) of the sphere projects into 
a point on the plane whose co-ordinates x^, x^ are connected 
with the y’a by the relations^ 

~ Jk (I- ')■<'-= (^= 1 . 2 ). ( 34 ) 

where 

U=l + ^p^ p^=k^J. (35) 


For any value of n, we shall ado]>t the same formulae, with 
the obvious modification that is to vary from 1 to n. This does 
in fact give a parametric representation of our hyperspherc; 
for squaring and adding the equations (34), and substituting 

n 

for a;„^its expression in terms of u as given by (30'), namely, 
1 


(w — 1), we get back to equation (33). 

K 


We have then, 


differentiating, 

= — 


2 


2du 





x^du 


{v ^ 1, 2, . . . n). 


Squaring and adding, and substituting 

n n 


4 4 

(u— 1) and 
A A 


for S,. x^^and 2S^x^dx^ respectively on the right-hand side, we 
1 : 
get finally 


dxj 

ds^ = 1 




which is the required result. 


^ These relations can easily he tihown to be the same os those ordinarily used if 
we replace p, the radius vector of the projection, by the colatitude ^ of the point 

on the sphere. As by dchuition Vi' yo = cos^, it follows that ~ ^ 008 *^ 4 ^, 
p =: 2R tau4^. 


(I>e5&) 
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CHAPTER IX 

Differential Quadratic Forms of Class Zero 
AND Class One 

1. Forms of class zero (or Euclidean forms). 

In Chapter V, p. 123, we defined the class of a given F,^ (or 
of the quadratic form ds^ which characterizes it) as the number 
N — n, where N is the minimum number of dimensions of a 
Euclidean space in which the F„ can be immersed. 

We shall consequently say that a quadratic differential form 

n 

ds^ = ( 1 ) 

1 

is of class zero (or is Euclidean) if it is possible to substitute for 
the n variables x a set of n variables y (since N — w), connected 
with the by the relations 

Vu = Vv • • • ^n) = 1,2,... n), (2) 

and such that (1) assumes the Cartesian form 

ds^ = ^^dy^? ( 1 ') 

J 

Given (1) we wish to find a criterion which will enable us to 
recognize whether such a transformation is possible. We shall 
show that it is sufficient to construct Riemann’s symbols relative 
to (1), and to determine whether they vanish identically or not. 
We have already seen (Chapter VII, X). 178) that this condition 
is necessary; we wish to j^rove that if, inversely, all Riemann’s 
symbols relative to (1) are identically zero, then (1) can be trans- 
formed into (!'); or in other words that the n functions (2) can 
be so determined as to satisfy the ^n(n +1) equations 

n 

dik ^ {i, /• 1, 2, . . . n), . (3) 

= W 

(cf. Chapter V, p. 122, formula (35) ). 
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By oovaxiant difierentiation of (3), we get 

11 

0 = s, + y^\iyv\ki)- 

By cyclic permutation of the indices i, k, I, we get from this 
the two further equations 

n 

0 — + yv\kyr\u)> 

n 

0 ^Ayv\ikyu\i + yv\iy.\ik)- 

1 

Now add the last two of these equations aud subtract the 
first. From the commutation rule (§ 6, p. 184), combined with 
the vanishing of Riemann’s symbols, it follows that the second 
derivatives are permutable, so that we get 

n 

= 0 . 

Keeping i and k fixed, and making I vary from 1 to n, this 
formula gives us n linear homogeneous equations in the n un- 
knowns - 1, 2, , . . n). The determinant of the system 

is certainly not zero, since it is composed of the terms y, 
i.e. is the functional determinant of the transformation (2); we 
therefore conclude that 

y^\ik =0 {v, i, k 1, 2, . . . n). . . (6) 

These equations, which we have deduced from (3), can be 
put in the form 

^ ^ ^ == • y ^\ »). (S') 

in which we are concerned only to the extent of observing that 
the right-hand side is a known function of position and of the 
terms 

It is now easy to see that the problem is reduced to that of a 
mixed system of total differential equations and equations in 
finite terms which we have already considered in § 8, p. 29. 

In fact, considering as unknowns the n quantities y,., and the 
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quantities we can collect together the equations (4) and 
(5') into a system of total differential equations 


n 

^yv\i ~ ^kfv\ik{^\yv\l^ 


yv\n) dXj, 


(v, i = 1, 2, . . . n), (S) 


while the group (3) constitutes ln(n + 1) relations in finite 
terms between the m (w + 1) unknowns. 

The conditions for complete integrability, by the usual rule, 
are as follows; 


{a) 

(b) 


^yj'i/r _ ^yv\h 
dxj, dxj, 

^frVk ^ 
dxi, dxj, 


\ (y, /, h, k 


1,2,... n)\ 


(c) the equations obtained by differentiating the equa- 
tions (3) must be identically satisfied in virtue of the 
equations (S). 

Introducing the covariant derivatives and once more apply- 
ing the coramxitation rule (or the second derivatives, the con- 
ditions {a) can be written in the form 

yv\kh — yv\hk linear combinations of Riemann’s symbols, 

and it will then at once be seen that they are satisfied identically, 
since the left-hand side vanishes in virtue of (5), and the right- 
hand side also vanishes, since by hypothesis Rieraann’s symbols 
are zero. 

A similar argument holds for the conditions (6), which are 
equivalent to 

yv\ihh — yu\ihk — linear combinations of Riemann^s symbols. 

Lastly, taking the covariant derivatives of (3), we find the 
conditions (c) in the form 

n 

^v{yp\iiyv\k ?A|i?/i-iw)3 
X 

and it can at once be verified that all these are satisfied, in virtue 
of Eicci’s leiuiua and the equations (6). 
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The mixed system is therefore comply, and it will be possible 
to find the functions (2), which will contain \n{n + 1) arbitrary 
constants, this being the difference between the number of un- 
knowns and the number of equations in finite terms. In geo- 
metrical terms, if the manifold is Euclidean there are in it 00*"^"+^) 
(orthogonal) Cartesian systems. If we can find a particular 
solution 7 ) 2 , .. . 17,,, we can get the most general solution by 
a substitution of the type 

H 

Vi H” ^5 • • * * (^0 

where the a’s arc the coefficients of an orthogonal substitution^ 
j.o. are connected by the ln{n + 1) equations 

0*^ ^ 1, 2, . , . n)., . , (7) 

1 

while the c\s are n completely arbitrary constants. 

This can be immediately proved from the cJiaracteristic pro- 
perties of orthogonal substitutions. In fact, from equations (6), 
differentiating, squaring, and adding, we get 

91 

dy, ------ 'Lja,jdr)j, 

n 

1 

n n 

ai^d-q^dr)^-, 

1 1 

summing the last of these with respect to i and using (7), we get 
i.(dy,f = %^h)d7)jd-,),. - 'ijT)/ 

1 I 1 

The hypothesis that the are a particular solution of the 

n 

system is expressed algebraically by the equation dri^^ ds^; 

1 

hence we can write 

S, {dyif =- d8\ 

1 

which proves that the rdso constitute a solution. An easy 
calculation shows that the number of independent constants in 
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(6) is \n(n +1), and hence the solution so obtained is the most 
generaL 

It is obvious that the equations (6) are a generalization 
of the formulae for changing the co-ordinate axes in ordinary 
analytical geometry. 


2. Conformal representation of a manifold of constant curvature 
on a Euclidean space. Mutual applicability of all F^’s with the 
same constant curvature. 


In the preceding chapter (p. 236) we solved the following prob- 
lem: given a Euclidean space to find a manifold V\^ of given 
constant curvature which can be confornaally represented on S,,. 
We now propose to prove that conversely, given a manifold 
of constant curvature, it is always possible to represent it con- 
formally on a Euclidean space In other words, if ds^ is the 
line element of a F„ of constant curvature K, we wish to prove 
that a suitable function U can be so chosen that 

ds'^ == = A 

is Euclidean. 

The necessary and sufficient condition for this is that the 
equations (18') of Chapter VIII, p. 232, should all be satisfied 

by pitting ^ ^ 

(ij, hk) = K — Of*-®;/,). 


and writing U instead of u. V innst therefore satisfy the 
— 1) equations 


/AU 

\U^ 


Putting 




K ) ajf, 


^ik ^jh) 


' ,k 




u u 

^ik = {ojj + P), 


® 


JJ- 


(8) 


where a and are two constants, and following the same method 
as that used in § 7 of Chapter VIII, p. 236, we see that these 
equations are satisfied provided ultimately 


AU 




( 9 ) 
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If we consider the equations (8) as defining all the derivatives 
of the n quantities then together with the identity 

dU=^U,dx, (8') 

1 

they constitute a total difft^rential systcnj in the w + 1 functions 
Vi, U; the equation in finite terms (9) is to he associated with 
it. It is easy to verify that we need only take a = — K in order 
that this mixed system may be completely integrable (cf. Chapter 
11, p. 29). ^ 

In fact, the conditions of integrability of the equations (8) 
are expressed by the commutation formula? (§ 6, p. 184) 

^i\ki — Ui\ik ^ — Sy [ir, A:Z) ? 7 y, . . (C) 

and those of the equations (8') by 

Ui, = U,i, 

These latter conditions are at once satisfied, on account of 
equations (8). The left-hand side of (C), also by (8), reduces 
to 

a(a,;, Ui — au Uf,), 

and the right-hand side, using the expression (19') of the pre- 
ceding chapter for Riemann’s symbols for manifolds of constant 
curvature, becomes 

— Uc — aa Uj,). 

The equations (C) therefore reduce to identities provided, as 
stated above, we take a — — IC, 

Lastly, there is the equation in finite terms (9); putting 
a == — K, this becomes 

Uj = — KV^ + 2PU. . . . (9') 

1 

Differentiating this, using formula (16') of p. 162, and taking 
out the factor 2, we get the conditions 

= -KUU,+ PU„ 

which are also identically satisfied in virtue of (8). 
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Renmrk. — Having thus seen that the system is completely 
integrable, we know (§ 8, pp. 29-33) that the solution contains 
n arbitrary constants w^hich we can choose in such a way that 
at a specified (but perfectly general) point O of the manifold the 
n functions V i take values arbitrarily fixed in advance. Further, 
the constant jS is still at our disposal. 

We get a first class of solutions if we take jS = 0, which 
makes (8) into 

^ ik ^Uc 

Tlie liypothesis jS — 0 is therefore admissible in the real 
field only when J5C < 0; in fact, for = 0 the equation (9') 
reduces to 

ACT - KU\ 


In the real field the left-hand side is always essentially positive, 
excluding the case when the function U is a pure constant, or 
in other words (on account of equations (8), which now reduce 
to — — K V) retaining the conditions iC =4- 0. Since 
the right-hand side has the opposite sign to K, it follows that 
the equality is possible only if K <. 0. 

In order to have a generally valid solution, wc must suppose 
jS =4= 0. We shall then choose jS and the 7i other constants so 
that 

U,^0, U 1 (i - 1 , 2, . . . n) 


at the point 0, so that from (9') we see that ^ — , and U will 

be comyjletely determined. ^ 

With the notation of the present problem (i.e. using dashes to 
denote quantities relative to the Euclidean space) we proved in 
Chapter VIII, § 7, p. 236, that if a factor u exists such that the 
manifold for which 


ds^ — 




has constant curvature if, and if the conditions u — 1, =- 0 

are satisfied at a sjiecified point O (which may always be supposed 
taken as origin of Cartesian co-ordinates), then the expression 
for u is j^r „ 

w - 1 + , 2, 

4 1 
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Furth^, we have now found that the quantity — satisfies 

all these conditions (in fact ^ ~ 1 at O, ^ ® 

at O), and therefore we must have 


u 1 , JSC " a 
4 1 

An extremely important corollary can be deduced from the 
foregoing results. Given two n-dimensional manifolds with the 
same constant curvature K, both their ds^, as we have seen, 
can be reduced by suitable changes of variables to the same 
canonical form 


where 


4 1 


It is therefore possible by a change of variables to transform 
one form into the other; or in other words, if two manifolds of 
the same number of dimensions satisfy the single condition of 
having the same constant curvature, then either can be conform- 
ally represented on the other. 


3. General remarks on hypersurfaces in Euclidean space. 
Second fundamental form. 

Let + i be a Euclidean space and ^2, . . . + i ^ system 

of Cartesian co-ordinates in it, so that 

1 

Consider a hypersurface V.^ (frequently called merely a 
“ surface ” 'when there is no danger of ambiguity) immersed in 
®n + i defined by the parametric equations 

= yA^xy • r . (»' = L 2, . . . n + 1). . (10) 

<D6&5) 9* 
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2SO 

As usual, the functional matrix of these equations must have 
n as its characteristic (cf. p. 87). 

As an obvious extension of the ordinary case {n == 2) we shall 
first define the direction of which is normal to F„ at any 
given point P. 

Let {v ^ 1, 2, ... n + 1) denote the cosines of the direc- 
tion we are in search of, relative to the axes y (i.e. the parameters 
or moments, which are indistinguishable in a Euclidean space). 
These cosines will be connected by the usual quadratic identity 

n+l 

] ( 11 ) 

1 

The geometrical property which we have to express is that 
the direction whose cosines are is perpendicular to any tangent 
to at P, or, which is the same thing, to any elementary dis- 
placement dP which is a tangent to V ,, and therefore (neglecting 
infinitesimals of higher order than the first) does not move out- 
side the surface. For every such displacement the equations 
(10) must still be satisfied, but the increments dxi of the a;’s 
will be otherwise arbitrary. If dy^ denotes the corresponding 
increments of the Cartesian co-ordinates y^y the a’s must satisfy 
the equation 

S'a.rfy, - 0 (ir) 

L 

for every system of rfy’s given by (10), i.e. by 

n 

dy^ 'Liy.^idxi, ( 12 ) 

with the dx's arbitrary. 

Substituting in (11') we have 

n 71 + 1 

= 0 ; 

1 1 

and since the dec’s are arbitrary this means that the a’s must 
satisfy the n equations 

71 

=0 (i = 1, 2. . ..«). . (12') 

1 

These equations, together with (11), determine the a’s except 
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as to sign. The ambiguity of the sign is natural, as we are dealing 
with a direction and have made no hypothesis as to its sense. 
In what follows we shall suppose the sense fixed in advance as 
may be most convenient. 

We know that the metric of F,* is defined by the quadratic 
form 

n 

<f> — ds^ ~ 

1 

In addition to this it is useful to consider a second differential 
quadratic form which differs from the first in that it depends 
on the configuration of F,^ in (or in other words is not an 
intrinsic element), or rather completely determines this con- 
figuration. 

To find this function we suppose an infinitesimal segment of 
constant length € measured off along the positive sense (as defined 
in advance) of the normal at every point of the given F„. The 
extremities of all these segments will lie on a hypersurface V'n, 
which is said to be parallel to F„; there is an obvious one-to-one 
corresj)ondence betw^een points on one and points on the other. 
We wish to consider two infinitely near points of F,,, and to com- 
pare their distance apart ds with the distance ds' of the two 
corresponding points of F,^, 

If the co-ordinates of a generic point of F,^ are {v — 1, 
2, . . . n -f 1), those of the corresponding point of will be 

yl = y. + €. 


From this, differentiating and remembering that c is a con- 
stant, we get 

dyl dy^ + eda^. 


Squaring and adding, we get ds'^\ denoting it by we have 




2erfy„<7a.,). 


-l-l 

Now 'E^dy^^-- and since e is infinitesimal it follows that 
1 

e^da^vA negligible compared with the other terms; hence 


^ ^ 26^ (13) 
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where we have put 

n+l 

^ dy^ da^ 

1 


(14) 


Formula (13) gives the increment of the first fundamental 
form ^ in passing from the given to an infinitely near parallel 
surface; tliis increment is expressed in terms of the quantity 
if/, which, as we shall now see, is a quadratic form in the dx^&. 
To show this, we note that 

n 

n 

da^ — 

hence, substituting in (14), and putting 

t»4-1 

^ik ^ ^ iy^li ^y\k H” yv\k^v\-i)> • (15) 


we get 


0 = 'L,^badXidXi. . 


(14') 


This is what is called the hccond fundamental forni. Its coeffi- 
cients ftjvr., given by (15), can also be expressed in another w'^ay, 
which will be useful farther on. Difierentiating (12'), we get 


1 




or, interchanging the indices I and k. 




^A^y\iyr\1{ ^vyv\kl) — 0 . 

Taking the half sum of these two identities, and remembering 
the symmetry of the second derivatives, we get 

w+l n+] 

— ^2,, (a,,|j.yy|j + “..jiyvifr) = '^v^vyv\ik‘ 


Changing I into i, the left-hand side of this equality becomes 
the same as the right-hand side of (15), and therefore 

«-f 1 


(16') 
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4. Forms of class 1 (hypersurfaces in Euclidean space). 

We now wi»h to find a criterion to determine whether a given 
differential quadratic form 

n 

ds^ ~ dXfg 

1 

is of class 1, i.e. whether we can find n + 1 functions (10) which 
will reduce it to the C'artesian type. We shall follow a method 
similar to that used in § 1, taking as unknowns tlu^ n-}- 1 functions 
y„ and their n(n + 1) derivatives making (n + 1)**^ unknowns 
in all. By definition these must reduce the given to the 

n I 1 

Euclidean form dy^\ this is expressed by the i^n{n +1) con- 
ditions ^ 

( 16 ) 

1 

From these by co variant differentiation we get the equations 

n -^1 

0 - --- ^A?h\uy,\b + yu\iy^\ki)- • • • (i7) 


We have also the condition that the principal unknowns 
and the auxiliary unknowns are not independeiiit but are 
connected by the differential relations 


dxi 


yr\, 


(18) 


We have to determine the conditions of integrability of the 
system composed of (!(>), (17), (18). 

First, suppose writt en down the two equations obtained from 
(17) by cyclic interchange of i, k, l\ from these three equations, 
by adding two of them and subtracting the third, we find, as 
in § 1, 

71 + 1 

'^,.yv\iicyr\i = (». hi = 1 , 2 , . . . 

Kcejiing i and k fixed, we have n linear homogeneous equations 
in the n + 1 unknowns The matrix of the coefficients 

as in the preceding section, has w for its characteristic; 
hence the equations have (n + 1) — n = 1 independent solu- 
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tion; the others differ from it by a multiplier. Now we see from 
(12') that we get one solution by taking == a^; hence, intro- 
ducing a multiplier b^i,, we can write the most general solution 
in the form 

yv\ik (i®) 

To find the significance of these 6’s, multiply (19) by and 
sum with respect to v from 1 to n + 1, using (11); we get 

H+l 

comparing this with (15'), we find that the 6’s just introduced 
(which have the property 6,^ =--^ are identical with the co- 
efficients of the second fundamental form. 

We have now to express the fact that the second covariant 
derivatives of the quantities satisfy the commutation formula 

n 

yv\ihk yv\ikh ~ 

which takes the place of the ordinary conditioii of symmetry of 
the second derivatives. To calculate the left-hand side we must 
start from (19), By covariant differentiation we get 

yv\ikh ^v^ikh Kk^v\hy • • • ( 21 ) 

and we have to calculate a„|;j. To do this, we note that on dif- 
ferentiating (11) we have 

n+l 

=0. (22) 

and also that the coefficients can also be expressed in the 
form 

^vyv\i^tf\h ^ih ('^y 1, 2, . . . /2r), • (23) 

which is at once verified by covariant differentiation of the 
identity 

S,3r,|,a, = 0 
1 

combined with the expression (15') for the 6’s. If h is fixed, the 
formula (23) represents n linear equations in the w + 1 unknowns 
a^l*; combined with (22) they form a system which can deter- 
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mine these unknowns. The determinant of this system is in 
fact 

ai ttg ... O-n+l 

& 1\1 y2|l • • • yri+l|l 


squaring this, and remembering (11), (12'), and ( 16 ), we get the 
determinant || a,^ ||, which is certainly not zero. 

It is easy to verify that the solution of the system ( 22 ), ( 23 ) is 

a^|A (24) 

where we have put 

yt = ( 26 ) 

Hence ( 21 ) becomes 

n 

yv\ikh ^ t'k ^jh 

1 

The expression for is obtained from this by interchanging 
the indices h and k. We can therefore write (20) in the form 

71 n 

i^ikh ^ihk) (j^ik ^jh ^ih ^jk) yt ' ~ f (^^) 

1 ‘ 1 

In order to express tlie riglit-hand side too in terms of yl we 
apply Cramer’s usual rule to ( 25 ), which gives 

y,\i = ^j<^jtyU 
1 

and substitute this result in the sum; summing with respect to 
I we have 

{il, M; 'hJc)yl, 

1 1 

and therefore ( 26 ) becomes 

n 

— o-y (6 *a — biM) + yt [(^O- f>jh — f^ih bj^) + (tj, M) ] ^ 0 

(r — 1, 2, . . . M + 1; i, i;, A = 1, 2, . . . n). 



. ( 27 ) 



256 : ABSOLUTE DIFFERENTIAL CALCULUS 

These conditions can be expressed in a considerably simpler 
form. Multiply the equality just written by a, and sum with 
respect to v from 1 to « + 1 ; remembering ( 11 ) and observing 
that from (25) and (12') 

1 11 
we get ~ = 0 , 


or in other words the coefficients b must satisfy tlie condition 


^ihk ( 28 ) 


The condition (26) can then be written in the form 


^jVlPij.hk =0 (i/ 1, 2, . . . n + 1), 

1 


(29) 


where we have put 

Pt,j,hk = if>ik^jh — 


Keeping i, h, h fixed, the equations (29) constitute 
n + 1 linear homogeneous equations in the n unknowns 
PKhhk (i == 1 , 2 , . , . n). The characteristic of the matrix of 
the coefficients yl is n; in fact, taking any one of its deter- 
minants of order n, e.g. 


K y\ 


y\ 


y\ y: 


y 


n 

2 


y! yl • • • y'! 

it will easily be seen with the help of (25) that it is equal to the 
product of the two determinants 


Vili 

2^112 • • • 

2/l|n 

lo“ 

1 

1 

a’® 

• . • €L 

y-Mi 

t/,,., .. . . 

y^n 

1 • 21 

i : a-^ 

22 

a 

• • • (X 

y«ii 

y,.|2 • • • 

ynln 


a”® 

, . . 
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the second of which is certainly not zero. It follows that the 
characteristic of the matrix yl is the same as that of the matrix 
which is n. From a well-known theorem on linear equations 
it follows that the system (29) has no solutions except 

Pij^fje 0 h, k 1, 2, . . . n), 

which is the same as 

{{j, hk) = — b^bjf,. . . . (30) 

A more rigorous discussion would show that tlie formulee 
(28) and (30) express all the conditions of integrability of the 
system. We can therefore conclude that: 

The necessary and sufficient conditions that a given differential 
quadratic form may he of class 1 are that it shall he possible to 
determine a (real) symmetrical double system b|i^ such that 
Riemann's symbols for the given form can he exp^^ssed by 
formula (30), and also such that the system {the covariant 
derivative of b^k with respect to the given differential form) is 
symmetrical {formula (28) ). 

At the end of last chapter (§ 7, p. 236) we found directly, 
by assigning suitable explicit expressions to the functions 

(^i> ^2? • • • every ds^ of constant positive curvature 

is of class 1. The necessary and sufficient conditions just 
enumerated must of course be satisfied. 

To verify this, we need only take the auxiliary quantities K 
in the form \/K a^j^, and remember that, as the manifold in 
question by hypothesis is of constant curvature, Riemann’s 
symbols (y, hk) take the form K{a^f, — a,jc<i}k)- The conditions 
(30) are therefore automatically satisfied. Further, by Ricci’s 
lemma, the covariant derivatives of the quantities 6^4., i.e. of 
\/K vanish, so that the conditions (28) are also satisfied. 

For JS' <: 0, the hypothesis — >/K is of no use, as 
it would take us out of the real field, so that we cannot assert 
that the analogous property holds. We can in fact prove that 
for n 2 a ds^ of constant negative curvature is not of class 1.^ 
For n = 2 we know already (§ 21, p. 123) that any rfs®, and 
therefore in particular a ds- of constant curvature, is of class 1 

^ Cf . liiANOHi: Lvzioni di ffeofiitiria, differenziale^ lind edition (Pisa, Spoerri, 
1902), Vol. I, Ch. XIV, § 205, p. 471. 
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(at most), or in other words belongs certainly to some surface of 
ordinary space. There are an infinite number of surfaces of this 
kind (pseudospherical surfaces), with constant negative jff, in- 
cluding surfaces of revolution of three types. ^ 


5. Hyperspherical representation and curvature of a hyper- 
surface. 


Take any hypersurface F,j, and consider it as immersed in a 
Euclidean space + and consider also a hypersphere of unit 
radius and centre the origin.^ 

We can make each point P of the correspond to a point 
P' of the hypersphcre by drawing from the centre of the latter 
the parallel to the normal to the F„ at P, and taking the inter- 
section of this parallel with the hypersphere as P'; is then 
said to be represented on the hyperspluTC. 

The chief interest of this representation is as follows. Let 
F denote the extension (Chapter VI, p. IGO) of a region ^ of F„, 
and F' the extension of the corresponding hyperspherical region 


F' . 

(f>. Then the ratio is closely related to the curvature properties 


of F„, and is called the merm curvature of V„ in the region (f>. If 
this region reduces to the infinitc;sima] region round a point 
P — or in other words if the maximum dimension of <f> tends to 

F' 

zero — then (if P is not a singular point) the ratio tends to a 

positive limit F, which is called the hyperspherical (if n =■ 2, 
the sjdierical) curvature of the F,^ at P. 

To find an expression for this quantity, we shall first establish 
a system of intrinsic co-ordinates on the hypersphere. The most 
obvious way of doing this is to assign to each point P' of the 
hypersphere the co-ordinates iCg, . . . x,^ of the correspond- 
ing point P of We shall call the line element of the 

hypersphere da, and shall try to find an expression for it in 
terms of the dx^s. 


^ IfM,y Ch. VJr, § 103, p, 225; or 3rd edition (Bologna, Zanichelli, 1922), 
Vol. T, Ch. VIT, § 127, p. 338. 

2 That is to say, as explained in § 7, Chapter VITI, p. 240, a hypersurfaoe 
On whose equation in Cartesian co-ordinates is 
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If we denote the direction cosines of the normal to F„ at a 
generic point P by (v 1, 2, ... w -f 1), as in preceding 
sections, then the direction cosines of the parallel through the 
origin (the centre of the hypersphere) to this normal will also 
be a„. The point P' lies on this line, at unit distance from tie 
origin; its Cartesian co-ordinates are therefore a^. 

We then have at once 

n+l 

1 

n+l 

and putting .... (31) 

n 

it follows that da^ S/,*. dXf^ dxj^ (32) 

1 

Tliis is the first fuTidamental form relative to the hypersphere; 
it is sometimes called the third fundamental form of the given 
By means of it we can at once calculate the extension F' 
of a hyperspherical region <j)': 

V' = , s/ e dxi dx^ . , . 

wliere e re])rescnts the determinant of the quantities Analog- 
ously, for the corresponding field of we have the extension 
V of cf>: 

V — dx^ dx2 - . . dx^. 


If the regions considered are infinitesimal, each integral 
reduces to a single element; taking the ratio of these, we get 


r 




(33) 


where in every case the radicals are of course supposed to have 
their absolute values. 

The coefficients can be expressed in terms of the derivatives 
of the y’n by means of (31), which on substituting for the 
values given by (24) becomes 

n w+1 
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and by (25) and (16) 

n Ti + l 

^/i/f ^ ^ y»/|u 

= 2:y„„6^6j*a‘“o^'’a„„ 

1 

From this expression of the in terms of the and the 
it is easy to obtain an expression for the determinant e in 
terms of the determinants a and 6. To find it, put 

Pk “ ( 34 ) 

1 

so that the last of the formula^ just given for may be written 
as 

O,. ■ iS; (34') 

1 

Comparing (34) and (34') witli the formula) for the general 
term in the product of two determinants, wc see that from them 
follow the two equations 

(35) 

a 

e -- ftjS, (35') 

where ^ = ||a*^|| and we have put ^ ~ ||^i||, as can easily be 
a 

verified. Multiplying together (35) and (35') term by term we 
have 

e (36) 

a, 

Hence (33) becomes 

(33') 

i a 

a formula expressing the hyperspherical curvature F in terms of 
the discriminants of the two fundamental forms. 

It will be seen that the curvature defined here is not an 
intrinsic property, as it depends on the coefficients 
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Let 118 apply these remarks to an ordinary surface Fj im- 
mersed in a three-dimensional Euclidean space. In this case, 
as we know, there is one distinct Eiemann’s symbol (12, 12), 
and (30) gives 

(12, 12) - 6n&22-- = &• 

Hence (33') can be written in the form 

r = i I 
a r 

Comparing this with formula (28) on p. 19t, wc s(5C that 
for n = 2, the curvature F coincides in absolute value with the 
Gaussian curvature K. 


CHAPTER X 

Some Applications of Intkinsic Geometry 

1. General remarks on congruences. Geodesic and normal 
congruences. 

Consider a metric manifold P,,, and suppose that at every 
point of (or of a region of P„) there is fixed a direction X, 
defined e.g. by its parameters A'; i.c. that there is given a contra- 
variant system of regular functions Xg, . . . connected 

only by the usual quadratic identity and otherwise arbitrary. 
On account of this identity one at least of the parameters A’ is 
certainly not zero. 

If then wc consider the following system of n — 1 differential 
equations 

dx^ dx^ dx,, 

Ai ” A2 “ ' ‘ ■ A^‘“ . . . ^ ; 

(considering c.g. one of the x’s as the independent variable and 
the other n — 1 as unknown functions of the first), we see at 
once that the integrals of this system represent lines of which 
at every point are in the previously fixed direction X; in fact, for 
an irifiniteBimal displacement along one of these lines, the rfap’s 
are proportional to the parameters of X. Through every point 
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of the region considered there passes one (and only one) of these 
lines; this follows from the fact that the general integral of 
(1) contains w — 1 arbitrary constants, which can be so deter- 
mined that for an arbitrarily assigned value of the independent 
variable the other n — 1 variables have values which are also 
arbitrarily assigned. To fix the ideas suppose that in the field 
considered is not zero; then (1) can be written 


dx, __ A' 
dx„ A" 


(i - 1, 2, ... n - 1), 


considering as the independent variable. 

It follows from the existence theorem that the integral 
equations 

rr, {i 1,2,... 71 — 1) 


of the line can be satisfied by an arbitrary set of values of the 
n variables, which is equivalent to saying that the line can be 
made to pass through a point arbitrarily fix(*d in advance.^ 
Such a system of lines is called a congrue7ice. The quantities 


dXi 

ds 


, where ds denotes the element of arc of the line passing 


through the generic point iCg, . . . are called the para- 
meters of the congruence, and the elements A/ of the reciprocal 
system are its rno'ments. 

If all the lines of a congruence are geodesics, the congruence 
is said to be geodesic; e.g. congruences of straight lines in ordinary 
space. It is easy to determine the analytical condition which 
expresses this property. We know that the characteristic equa- 
tions of a geodesic can be put in the following form (cf. Chapter 
V, formula (5*‘5), p. HP 




d^ 

ds 




where A‘ 


dxi ^ 


^The ar^uinetifc may V)e made clearer by considerinj;; the example of a field 
of force in ordinary phyeics. In this case, when a direction X (that of the force) 
is physically defined at every ])oint of the space considererl, then a system of lines 
(the lines of force) is dettirniined which have at every point the direction of the 
force at that point and which, so to speak, fill all space, as through every point 
there passes one (and only one) line of the system. 
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Now we have - = Sj Xi; 

ds 1 oxi 

substituting in the previous equation and writing everywhere 
instead of we get 

n /Ci\i n \ 

f = s, A* == 0. 

from which, by (6'), p. 147, we get 

- S, (A-), A' =0 (i - 1, 2, . . . n). . (2) 

1 

These are the required conditions. We can express them partly 
in terms of the fnonients by multiplying by au^ and summing with 
respect to which gives 

Pk ^<7 {^^)i 0 ; 

1 

and as by Ricci’s lemma 

^ik (A^)/ = 

we get finally 

p^=S,Afc,A‘=0 (&= 1,2, . (2') 

1 

Another important special property wdiich a congruence may 
have is that of being normal, i.e. that of being composed of the 
orthogonal trajectories of a family of surfaces. It should be noted 
here that, given a family of surfaces, there always exists a congru- 
ence of curves which cut all the surfaces of the family at right 
angles and are called orthogonal trajectories', wliile there does not 
always exist a family of surfaces which cut at right angles all the 
curves of a congruence. This can be shown as follows.^ 

First, let there be given a generic family of surfaces whose 
equation is 

f{x) = constant. 

^ It may be notod inci<lentally that in chapter V, p. 127, we have already recog- 
nized the existence of the directions normal to the fatmliea of c<i-ordinate surfaces 
Xi constant, and determined their moments. These results could have been 
used here, as any family of surfaces f = constant can always be turned into co- 
ordinate surfaces by a chanj»e of variables. The line of argument followed in the 
text has the advanta^fe of giving’ direictly the explicit expression for the moments 
of the normal directions when the equation of the family of surfaces is general. 
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Consider the surface whicli passes through a specified point 


P whose co-ordinates are x,, x.. 


X' it is undejstood that P 


is regular, i.e. that the first derivatives 5^- are finite and con- 

dx. 


tinuous at P and are not all zero. We wish to show that a direc- 
tion perpendicular to the surface, i.e. to every displacement 8x^ 
belonging to the surface, is uniquely associated with P. 

We first note that for ev’^ery such displacement Sx^ we have 


/(X + Sx) = fix) 


or 


S/ 8xi = 

1 OXi 


0 . 


. (3) 


If we denote by A, the moments of the hypothetical perpendi- 
cular direction, then tlic condition of perpendicularity to every 
dis])lacement in the surface is expressed by the relation 

^,K8x, =0, (4) 

1 

which must hold for all values of the Sa?/s which satisfy (3). The 
coefficients in (3) and (‘1) of each Sr, must therefore be proj)or- 
tional (cf. § 3, p. 250). In virtue of the quadratic identity 

- 1 


the moments cannot all be zero, so that we can suppose that one 

1 8f 

of them, say A,„ is not zero, and put - — P- Writing /, 
instead of * for shortness, the explicit relations equivalent to 

G X I 

(4) take the form 

fi = pK (» 1, 2, . . . n). , . (6) 


The fi's, being known, these equations determine the A/s, 
except for a factor, which in turn is determined (except in 
sign) by the above-mentioned quadratic identity, which gives 

n 

("'ikfifk “ The left-hand side cannot vanish, as by hy- 
pothesis one at least of the //s is not zero; we are therefore sure 
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that p =4= 0. Thus given the family of surfaces / ~ constant, 
the orthogonal direction at each point is uniquely determined; 
the positive sense on this direction can be chosen at will (cor- 
responding to the double sign of p). The A/s being known as 
functions of position, the reciprocal elements A' can be obtained 
from them, and thence, by (1), we get a congruence of lines 
which cut orthogonally the surfaces of the given family. 

Vice versa, given a priori a congruence of lines by means of 
their moments A, (to be considered as given functions of position), 
then in order that the lines of the congruence may be considered 
as orthogonal trajectories of a family of surfaces / ~ constant 
the necessary condition is that the derivatives of the function 
/(.rj, (which is a priori unknown) should satisfy (5), in 

which p denotes a factor which is not zero, but is a priori un- 
determined. Such an f does not always exist; we have indeed 
already seem that the necessary and sufficient conditions for its 
existence are (Chapter II, p. 29, formula (23) ) 




i+^,( 

(dX, 





•\8a'. 

dsj) 


dx^j 

\0a5^ 

! (6) 



(«'. 3 . 

h = 

1,2,. 

• • n), 




where we must now take Jf,- “ A^, Xj = A^, Xj^ = A^.. Only 
some of these conditions are distinct, e.g. those in which the index 
k has the fixed value n (the conditions (20) of p. 27), the others 
being dediicible from them. 

2. Sets of n congruences. Determination of a vector by n 
invariants. 

We shall now consider n congruences of lines in a generic 
Vni thus n directions X^, Xg, . . . X,i will be fixed at eacli point. 
We shall further suppose that every two of these directions are 
orthogonal, and we shall then say that we have fixed in Vn 
a set of n orthogonal congruences. 

The parameters and moments of these congruences will of 
course have two indices, the first of which represents the ordinal 
number of the congruence. We shall use the term the congruence 
{h) to denote the congruence whose parameters are AJ, A^, . . . A}J, 
and whose moments are therefore the reciprocal elements Ay^ji, 
\| 2 > • • • (with respect to the ds^ of the manifold). 
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In addition to the usual quadratic identities we shall here 
have the conditions of orthogonality of the congruences. Both 
sets are included in the formula 

= SJ ih,k= 1,2,... n); . . (7) 

ii k ~~ k, this is the usual relation between parameters 
and moments, and if A A; it expresses the fact that the 
directions and X*, i.e. the congruences (h) and {k), are 
orthogonal. 

The equations (7) also express the essential fact that the 
parameters of a set of n orthogonal congruences are the recip- 
rocal elements (in the algebraic sense) of the rfi moments 
of the same set of congruences, and vice versa (cf. Chapter IV, 
p. 74; Chapter VII, p. 206). In addition to (7) the equivalent 
formulae 

= S;. = 1,2, ...n) . (7') 

therefore hold. Multiplying these by ajj^ and summing with respect 
to j we get the important formula 

n 

^ik ^ Ij 2, . . • Tl) . (7 ) 

giving the coefficients of ds^ in terms of the moments of any set 
of n orthogonal congruences. Analogously, multiplying (7') by 
a^, summing with respect to i, and then putting i instead of j, 
we get 

= 2, a;; XI {i, A = 1, 2, . . . n). . (7'") 

1 

A vector R of our is determined, as we know, by its co- 
variant comi^onents i?, or its contravariant components i2*. 
Hence when a set of n congruences is fixed in the vector can 
also be determined by its n projections on the directions belonging 
to these congruences at the point where the vector is considered. 
By definition (Chapter V, p. 126), the projection of R on the 
direction X;^ is the invariant 

Ot — R X X^, 
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which can be expressed in either of the two equivalent forms 

( 8 ) 

( 9 ) 

1 

Thus the vector R is determined by the n invariants C;,. If 
we wish to deduce from these, in a given system of reference, the 
covariant or contravariant components, we need only solve the 
equations ( 8 ) or (9), which, together with (7^), give 


n 


1 

. . . (8') 

n 

A/i j y* • • 

. ■ • (9') 


If in ]jarticnlar the vector R is the gradient of an invariant 
/ (i,e. if the components are the derivatives f, of / with respect 
to the variables x^), then the invariant Cj^ represents the intrinsic 
derivative of / in the direction of tlic congruence (A). In fact, 
if Sf^ denotes the length of the arc of one of the lines of the con- 
gruence (A), measured from an arbitrary origin, then for a dis- 
placement dsj, along this direction the increment of / will be 

df= 

1 


where the dx^s are the differentials corresponding to this dis- 
placement. 

Dividing this quantity by dsj^ we got by definition the deri- 
vative -- of/ in the direction of the congruence {h)\ remembering 


that 


dXf 


dSh 


— we therefore have 




( 10 ) 


a formula corresponding to (9). Solving it, we get the formula 


fi =- S, "•' A; 




( 10 ') 
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corresponding to (9'); and lastly, changing to the reciprocal 
elements, we get also 

f = ( 10 ") 

I OSf^ 

which corresponds to (8'). 

In general, it would be easy to show that when a set of n 
congruences is fixed a tensor of rank m can be determined by 
n"' invariants^ instead of by that number of components, co variant, 
contravariant, or mixed, the i)roof being completely analogous 
to that given above for determining a vector by means of n 
invariants. This result simplifies the study of certain questions, 
so that we shall find it useful to carry somewhat further our 
investigations on sets of n congruences. 


3. Geometrical definition of Ricci’s coefficients of rotation. 


We must now introduce a system of ditlerential invariants 
which are closely connected with the set of n congruences. We 
shall reach the required result quickly by the following 
method. 

Consider two very near points P and P' of V,^. At each of 
them the lines of the 7i congruences determine a pyraynid (a genera- 
lization of the notion of the trihedron) whose directions arc 
mutually orthogonal. If . . . X,^ are the n directions at P, 
those at P' will be Xj — X^ + S'Xj, , . , X,^ — X,^ + S'X,,, and 
we shall say that we pass from the first to the second by local 
displacement, i.e. by the law previously fixed which regulates the 
behaviour ot the n lines of the set of 7i congruences. But 

the pyramid of directions can also be moved from P to P' by 
parallel displacetneM; we shall then get at P' n mutually ortho- 
gonal directions X[ — h S'^X^, . . . X]'^ X„ + S^X„, which 
will not in general coincide with thovse obtained by local dis- 
placement. We shall thus have at P' two pyramids infinitely 
near one another, since each is infinitely near the pyramid* 
Xj, X 2 , . - . X,,. This meaus in particular that the ith direction of 
one makes an infinitesimal angle with the ith direction of the 


TT • . . 

other, and an angle very nearly equal to - with the remaining 

Ji 


w — 1 directions of the other. We propose to examine these 
infinitesimal difiEerences. 
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Consider two directions X*, X* of the pyramid at P; these 
either coincide {h = k) or are orthogonal, so that we have ■ 

A 

COSX;,X;^ = hi. 


Let them be displaced to P', the first by local and the second 
by parallel displacement, so that the first will coincide with 
X/t and the second with X/". We shall calculate the resulting change 
in the cosine of the angle between them, i.c. the quantity 

AAA 

S cosX/^X/^ cosX^^X;^ — cosX/^X/ . 

This is an infinitesimal of the same order as the distance ds 
between P and P', and we shall therefore write it in the form 
ds; thus will give us a kind of measure of the rate at which 
the cosine in question changes for a displacement in the direction 
PP', To calculate it, we start from the formula 

A 

cosX/.Xfr = 

and differentiate it, remembering that we have to operate on 
A/, with the symbol S' (local displacement) and on A)^ with the 
symbol S' (parallel disjdacement). We shall get 

Scosxtx, - s,(8'a,,.-a;, + a,,,‘S^a:). . (ii) 

1 


We have also from tiie ordinary rule of the differential cal- 
culus 

8' A., 


whore Sa;^ denotes the increment of the co-ordinate Xj in passing 
from P to P', and from the law of parallelism 

S' AJ. — — Sjj {jl, i j- A* SiCj. 

1 


Substituting in (11) we get 

dK 


Phkds 




dx, 


Sx. — 




All 


{jl, tjKSxj. 
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In the second sum interchange the indices i and I, so as to 
get the same factor Aj. SXj as in the first. We can then write 

— Si {ji, I } AftiiJ Xi 8xj, 
or, remembering formula (5) of p. 147, 

n 

JPhk^^ (y ^/f * • • (^1 ) 

Denoting the parameters of the direction PP' by 



we have the formula 

.... (11") 

I 

which holds for any direction whatever. 

It is to be noted that as given by the original definition 
(11), changes sign when the two indices are interchanged. This 
can be proved without difficulty, either from the final expression 
(11"), by going back to (7) and taking its covariant derivative; 

A 

or more geometrically, by using the property that any cosX/,X^ 
is unchanged by either local or parallel displacement, so that the 
formulae 

S-(S,A.„Ai)=.0. 5-(f,A.„Ai) = 0 

both hold. 

Carrying out both differentiations and using the results to 
transform (11), we get 

Phk^^ = S, (Ayi|,:'S' AJ, + Ai-'S A;jjj). 


Further since cosX/,X;t. can also be expressed in the form 

n 

Z, AaA(.|{, (11) is equivalent to 
1 

Phic^ = (8^ ^a’Aj.|, • + A\’S A*|<), 
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luterchanging 1i and A, and adding to the previous equation^ 
W 0 get the required identity 

Phk Pkh ® ^ 2, • . . w). . (12) 

We shall now examine the case when the direction of dis- 
placement coincides with one of the directions belonging to the 
set of congruences, say the Zth. We shall then have 



and, denoting by the value of 7 ?/,^ in this particular case 

A 

(i.e. the rate of variation of cosX/,X/; for a displacement in the 
direction of X^, in which X/, is moved by local, and X/,. by parallel, 
displacement), we shall have from ( 11 ') 

Vhki A:, Z — 1 , 2 , ... n). . (13) 

1 

The quantities y were introduced by Ricci, and named by 
him the coefficients of rotation of the set of congruences. They 
have various important j)roperties. 

In the first place, they are invariant, as follows from (13) by 
the law of contraction. Wo have farther, as a particular case of 
( 12 ), 

yn/d + y/,hf — 0 iK Tc,l = 1, 2, ... n), . (14) 

which for h — h reduces to 

yuKi 0 (15) 

Vv^e can also give a direct formal proof of (14), on the lines 
already suggested for the more general case of the p’s. Starting 
from the identity (7), and taking the covariant derivative, we 
shall get (remembering Chapter VI, p. 152) 

I 1 

Multiplying by A^, and summing with respect to j from 1 
to w, we get n . . n 

yhki + yirhi = 


or 
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The number of these invariants y, which depend on three 
indices, is a priori n®; but they are connected by the + 1) 
relations (14) of antisymmetry. Hence the number which are 
algebraically distinct is at most 

z= 

2 “ 2 

The minuend n® is equal to the number of the derivatives 
of the quantities and the subtrahend + 1) to 

the number of the relations given above as resulting from the 
differentiation of the equations (7) and connecting the w® deriva- 
tives. We can accordingly express the derivatives as func- 
tions of the quantities and y, by solving the equations (13). 
To do this, multiply (13) by A 4 -|i/A^iy, and sum with respect to 
k and 1. We get 

n n n , n , 

I t'j'j 

or finally, replacing i' and f by i and j, 

n 

^h\ij ~ ^kiyhu\\i\\j (16) 

This result shows that in order to study the differential pro- 
perties (i.e. the properties depending on the way in which the 
A’s vary) of the lines of the given congruences we need only 
consider the invariants y, in terms of which all the derivatives 
of the A’s can be expressed. 

The geometrical significance of the y’s, which we have already 
illustrated, is particularly expressive in the case of ordinary space. 
In this case the three congruences define at every point a triplet 
of orthogonal directions, and Pzv P 12 components of 

a vector oi such that a>ds is the elementary rotation of the triplet 
in the local displacement from P to 

^See e.tf. Lkvi-Civita and Amaldi: Lezioni di Meccanica Razioiude^ V<»1. I, 
p. 178; Bologna. Zanichelli, 1923. F(»r the general case, the reader may be referre-d 
furtlier to a paper Viy ^ignorina Carpanksk: “ Parallolismo o curvatura in una 
variety qualunque*’, in Aiinali di Mat., Vol. XXVIII, 1919, pp. 147-189. 
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4. Commutation formula for the second derivatives along the 
arcs. 

The invariants y occur in another important formula, which 
we shall now establish. 

We wish to compare the two second derivatives 

a a/ I 9 9/ 

.0Sfe 0 Sa 0S,. 


we shall find that they are not equal, but are connected by 
a more complicated relation involving also the first derivatives 
and the y’s. 

We have in the first place from (10), differentiating the 

a f 

invariant ~ with respect to oCj and applying to the right-hand 
side the rule for differentiation given in Chapter VI, p. 152, 


1 M 

dxj dSf^ 


^h\ij + 

1 ] 


We next replace in the first term on the right by the expres- 
sion given for it by (10") (putting I instead of h for tlie index of 
summation), multiply both sides by A^, and sum with respect 
to j. We thus get 


Ji, V’ 9 /df\ df . 


\<J 


+ 




By the definition (10) of the intrinsic derivative, the left-hand 

0 0 ^ 

side of this equation is precisely ~ ^ \ the first term on the 

dSf, ds^ 

right, from the definition (13) of the invariants y, reduces to 


S/ ynik ^ ~ yihk 

1 OSi 1 


0s; 


We therefore have 

L : 

0Sj. 0Sk 



(I>656) 


10 
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with respect to i and k from 1 to n; remembering (13), we get 

(y/ifc't' Yni*k') = 0* ... (19 ) 

As j may have any value, we can always choose it so that 
A^ijj 4= 0; thus wo have 

Ynki Ynik ("^j ^ 1, -2, , . . 9'ir 1), , (2^) 


where we have written i, k instead of ?7, kf. Reciprocally, if the 
equations (20) are satisfied, the equations (19') follow from them, 
and therefore also (1 9) as a necessary consequence. The equations 
(20) therefore constitute the required condition. 

It is not without interest to find this condition by another 


method, starting from the remark that if the quantities are 

to be proportional to the derivatives — of a single function/, 

dx^ 


these derivatives can bo substituted for them in the conditions 


of orthogonality 


^iK\iK = 0 (A= 1,2, ...n-l), 

1 


so that the hypothetical function / must satisfy the linear system 
of partial differential equations 

0 (A = 1, 2, . . .n- 1). 

1 dx. 


Reciprocally, if there exists a function / which satisfies these 
n — 1 equations, its derivatives must be proportional to the 
quantities A,,|;. 

Hence the conditions in question are the necessary and suffi- 
cient conditions that the given n — 1 equations may constitute 
a complete system (cf. Chapter HI, § 9, p. 52). 

To make the notation agree with that in Chapter III, we intro- 
duce the linear operators 

Xh = SiAl --- (A = 1, 2, ... ft — 1), 

1 CXi 

noting that (10) shows that these operators are identical with the 
0 

derivatives - - with respect to the arcs. We thus have the system 
0 . 9 , ^ 

XJ - 0 


(A = 1, 2, ... ft - 1), 
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and we have to express the condition that, for h, k ^ 1, 2, . . * 
w ■— 1, Poisson’s parentheses 

(X„X,)f=^. X,X,f^X,X,f 


are linear combinations of the terms Xif, 

Now, repeating the steps of the calculation in § 4, or better, 

0 0/^ 

borrowing from it the value already found for - -- we have 

dsf^dSf,, 




1 

dsuds^ 


ym 

1 


3/ 


+ K K- 


Interchanging 1i and k^ and subtracting, the second sum dis- 
appears. In the first, we must separate out the term correspond- 
ing to the value n of the index Z, and put Xif again instead of 

We thus get 

dsi 

{Xj^, Xj,)f — 'Ll {y„^j, — XJ-]r {y,^l^l, — . 

1 ds,, 


This must reduce to a linear combination of the quantities 

XJ {I = 1,2, ...w- 1). 

As is independent of the Jf/s, its coefficient in each of 

the parentheses included in the above expression (i.e. for 
hfh— 1, 2, ... n — 1) must vanish; this brings us back to (20). 

It may be noted that if all the w congruences of the set are 
normal, the y’s with three distinct indices are all zero. In fact, 
choosing three distinct indices i, h, k, we have the following 
identities: 

Vijik ~ y%kh^ 


yhki — yhuci 

ykih ykhil 


adding the first two and subtracting the third, and remembering 
that the y’s are antisymmetrical in the first two indices, we get 


yihk " ' yihin 

or ViAt ^ 0 {i, h,k= 1, 2, ... w) 

for every triplet of three distinct indices. 
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If we put 

Vij, A* ^ g ~ { Viji iVuik — Yikh) + Viik ytjh ~ Yiih Yijk } > 

we get at once, by definition and the antisymmetry of the co- 
efficients of rotation y with respect to the first two indices, 

ViJ^hk — Yilkhi YiUhk — — yji,hk* 

I add, but without giving a proof, that the cyclic identities 
Yij^hk Yih.kj “f" Yik,jh ® 
also hold; and from these it follows ultimately that 

Yij.hk = Yhkj} {i.j. h k = 1, 2, . . . n). 

Ricci discovered all these results as far back as 1895, basing 
his researches with regard to the four-index y’s on the analogous 
properties of Riemann’s symbols of the first kind (cf. Chapter 
VII, p, 179). A particularly simple and direct proof has recently 
been given by Dei.^ 

8. Canonical system with respect to a given congruence. 

In many questions a congruence of lines is either among the 
data of the problem, or is clostJy connected with them. In order 
to deal with these problems it is often useful to associate with 
the given congruence n — 1 others, forming with the given one 
a set of 71 mutually orthogonal congruences, so that the given 
congruence can be considered as the wth of this set. The choice 
of the n — 1 auxiliary congruences is a priori arbitrary; in many 
cases this arbitrariness may be taken advantage of to introduce 
some simplification. This is possible, as we shall now see; and 
the conclusion we shall reach is that given any congruence what- 
ever, there is always at least one way of choosing the other n — 1 
so that the relations 

Y7iki + Ynik =0 (A =4= 1, 2, ... n — 1) (21) 

may be satisfied. 

The system (or any one of the systems) of n — 1 congruences 

* ** Bullo relazioni differeii/.iali che logano i cocfficienti di rotazione del Ricci ”, 

in Rend* ddla R, Acc. dei Limei^ Vol.^XXXII (first half-year, 1923), pp. 474-479. 
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which possesses this characteristic is called a canonical system 
v)Uh respect to the given congruence. 

To prove that such a system exists, we associate with the 
given congruence a system — for the moment any whatever — 
of n — 1 other orthogonal congruences, and fix our attention on 
a generic point P of the manifold; for shortness we shall denote 
by TO the pyramid of the n — 1 directions Xg, . - . X,^,! drawn 
from P, orthogonal to X,^ and to one another. Suppose this 
pyramid rotated round the direction X,„ by which we mean that 
we pass from the pyramid to to another to' formed by n — 1 
other directions Xj, X^, . . . X^^^, also drawn from P, and ortho- 
gonal to X ,4 and to one another. We wish, if possible, to 
determine the rotation so that after it has been effected the 
relations (iil ) may hold. For this we shall start from the relations 


connecting the X^’s with the X^/s, which express analytically 
the rotation described. 

Let h — 1, 2, ... n) be the cosine of the angle be- 

tween the directions X/^ and X^. Naturally, if only one of the two 
indices A, k coincides with n, the corresponding a is zero 



/ • Tr\ 

X,i, and the corresponding angle is \\ while 


1 . 


The formulae for this are 


^hn 


= a..A 

= 1 . 


0 


(h,k= 1 , 2 , ...» — 1 ); 


We have in. any case by definition 

n 

= a/jt {h, k = 1, 2. . . . n), 
and thence, multiplying by and summing with respect to h, 

f W 

^A-|i — 

1 

Limiting k to the values 1, 2, . . . n — 1, for all of which 
= 0, we can take the sum on the right only to n — 1, so 
that we have 

^ w - 1 

AAr|» = 


= 1, 2, ... w — 1); . (22) 
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i.e. the moments of xa' are connected with those of oj by a linear 
substitution, as could have been anticipated, and the are 
the coefficients of this substitution. It is also to be anticipated 
that the substitution is orthogonal. To prove this, we take the 
equations (7"); putting k = i, they give 

Sa(Aa|,)'^ == a,i (i = 1, 2, ... n). 

The coefficients on the right depend on the co-ordinates of 
reference, but not on the choice of the congruences associated 
with (n). 

Since == 0 for A 4= it follows that for any value of 
i (4= w) the expression 

is invariant for rotations of the pyramid tu, and therefore the 
substitution defined by the a’s is orthogonal. We have now to 
arrange this orthogonal substitution of order n — 1 in such a 
way that the relations (21) may be satisfied. 

To do this we start from (16), from which we get as a parti- 
cular case 

n 

^A-|i (^*5 y ^ 1, 2, ... /i 1). 

The terms of this sum in which k ~ n vanish, by (15); those 
in which I = n can be separated out by writing 

71-1 

1 {j = "^Jcl Ynkl \ I i 1 / + I ) Ynkn \ 1 1 • 

1 

The last sum can be suitably transformed by replacing 
by the expression given by (13); we then get successively 

n n 

Ynkn | i ^kjtq | p(j ^k j i 

1 1 

I iq 
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We can therefore write 

n — 1 n 

\ I ij = ^kl Vnkl K I i 1 j + Ki \j K 1 iq K- 

Now in this formula it is to be remarked that the left-hand 
side and the last term on the right depend on the parameters 
and moments of the direction alone, and do not depend on the 
other n — 1 associated directions; the same must therefore be 
true of the remaining part, i.e. of the sum 

71. - 1 

^kiynkl\c\i^l\j J ■ 1? -2, , . , W- 1). , (23) 

We can therefore conclude that these expressions are invariant 
for any rotation whatever of the pyramid w. 

Of tile (n — 1)“ quadratic forms included in formula (23), 
which are obtained by choosing the indices i, j in every possible 
way, we are interested in any one in which i = j. Fixing the 
index i once for all, and putting for shortness 

K\, = (r 1,2, .. .n — 1), 

the corresponding quadratic form is 

ji-i 

y7ikl 2 ^kl ^ akl H" ynll^ * (23 ) 

1 1 

In this the coefficient of the product Zi is i.e. 

the left-hand sid(i of (21). If we wish to satisfy (21), we must 
make all the coefficients of the terms in Zj^Zi, for which k I, 
vanish by means of the orthogonal substitution (22), which we 
shall WTite in the form 

/I - 1 

^k ~ ^hk .... ( 22 ') 

1 

this is equivalent to reducing the invariant quadratic form (23') 
to the canonical form 

(23") 

1 

hy an orthogonal substitution. This algebraic problem is always 
soluble. In the cases n — 1 == 2 or 3, it corresponds to the pro- 

(d655) 10* 
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blem of finding the axes of a csonic or a quadric, and is discussed 
in ordinary analytical geometry. In the general case the theory 
leads to the following result. 

Consider the equation 

II + Ynu) — 8* P II = 0, . , , (24) 

which is of degree n — 1 in the unknown p, and is called a 
secular eqiuUion. Its n — 1 roots are always real (it is understood 
that we suppose the quantities real), and give the n — 1 
coefficients of the canonical form (23").^ 

We can therefore always choose, at any point P, the pyramid 
m and therefore the system of the n — 1 congruences (1), (2), . \ . 
(n — 1) so as to satisfy (21); i.e. there always exists at least one 
canonical system with respect to a given congruence. If the 
n — 1 roots of (24) are all different, the canonical system is 
uniquely determined; if they are all equal, any system oi n — 1 
congruences which are orthogonal to one another and to {n) 
satisfies (21) and may therefore be called canonical. In the 
general case where the number of different roots is p (1 < jo < 
n — 1), then n — 1 — p coefficients of the orthogonal solution 
are arbitrary, and there are therefore canonical systems. 

9. Congruences of straight lines in Euclidean space. Oeo- 
metrical significance of the canonical system. 

In ordinary (i.e. Euclidean three-dimensional) space parti- 
cular importance attaches to congruences of straight lines, which 
present themselves for consideration in various questions of geo- 
metrical optics; since the rays of a light pencil (in a homogeneous 
medium) form a rectilinear congruence. 

We shall now discuss a geometrical property of th(jse con- 
gruences, which will be seen to be connected with the discussion 
in the preceding section; or rather — since it involves no greater 
complication — we shall discuss congruences of lines in a Euclidean 
space of any number n of dimensions. 

Consider a generic point P, and let r be the ray through P 
of the given rectilinear congruence; let X be the hyperplane 
(in ordinary space the plane) perpendicular to r at P. Take a 
displacement in X represented by the infinitesimal segment 

^ Compare Chapter VII, p. 205, where referenceft are given. 
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PP' = € in any direction; through P' will pass another ray 
r' of the congruence. In general, the two rays r, r' are skew; 
if for a particular direction of the displacement PP' it happens 
that they both lie in the same plane, i.e. that they meet or are 
parallel (more precisely, that the minimum distance between 
them is an infinitesimal of higher order than e), this is called a 
focal direx^ion. We shall now show that in general there exist 
n — 1 foc^l directions, all or some of which may be imaginary^ 
coincident, or indeterminate; we shall then point out an impor- 
tant particular case in which these directions coincide with those 
of the canonical system. 

Let PP' then be a focal direction; there will be a point C 
(which may be at an infinite distance) common to r and r'. 

Denote the length CP by — (so that we shall have the particular 

<0 

case of the rays being parallel at the limit when a» 0), and 
let us take as axes of reference n orthogonal (Cartesian axes 
(v ~ 2, , . . n). Let be the cosines of the direction n (i.e. 

its parameters or moments, since in Euclidean space = Aj'). 
The projection on the axis of the segment CP will then be 

given by - A„ • and that of CP' will be 
a> 

A,,|^ + d(- A,,j„y 

w \a» / 

while the projection of PP' is dy^. If then we express this last 
term as the difference of the other two (PP' being the third side 
of the triangle CPP'), we have 

dy^ = ^n\vd— + — 

We now wish to use the methods of the absolute calculus. 
We shall therefore asstxjiate with the given congruence n — 1 
other congruences, orthogonal to it and to each other, which we 
shall distinguish by the indices 1, 2, ... n — 1. In addition to 
the projections of PP' on the axes we require also its projections 
on the set of n congruences so defined; for this we must multiply 
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the last equation by Aji (A = 1, 2, . . . n) and sum with respect 
to V. First, let h = n; the projection of PP' on the direction 
n is zero, as PP' by hypothesis belongs to the hyperplane X\ 
hence the left-hand side is zero. Further, in consequence of the 
identity 



— 1, 

1 

it follows that 

S^Ar^Aq. = 0; 

hence finally we get 

0, 

€JJ 


which expresses the a priori evident fact that CP == CP' 
(of course neglecting infinitesimals of higher order than the first). 
Putting h in turn equal to 1, 2, ... w — 1, and denoting by e* 
the projection of PP' on the direction h, we find 

e* - I S,A,:;dA,.,, (A = 1, 2, . . . « - 1). 

O) I 

We shall now expand remembering that since Chris- 

tofEeFs symbols are all zero, we can replace the ordinary by the 
covariant derivatives, and also that since Cg, . - . are the 
projections of PP' on the directions of the set of congruences, 
and dyi, {k -- 1,2,... n) its projections on the axes, we there- 
fore have 

The last formula thus becomes 

in n - 1 

\i I p7r 9 

O) 1 1 

n - 1 It ^ 

or ojCf^ — Xj €j 23^4. I X'l A, . 

1 1 

Remembering the definition of the y"s we have the system 
of n — 1 equations 

II ~i 


(h ], 2, . . . n 1), . (25) 
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which we can also write 

S- {y,^j - 8i<o)€i = 0 (ft = 1, 2, ... w - 1). (26') 

1 

This linear homogeneous system must determine the focal 
directions PP' (if they exist) in the hyperplane X, by giving 
their projections €2, .. . on the orthogonal directions 
1, 2, ... n — 1 which we have associated with the ray r. 

The necessary and sufficient condition that the system (25') 
may have solutions e which are not all zero, is that the deter- 
minant of the coefficients should vanish, i.e. that to should satisfy 
the equation of degree n - 1 

II Y.^j - a; CO II = 0 {h,j ^ 1 , 2 ,... n- 1). (26) 

To every root to corresponds at least one set of values of the 
€*8, i.e. at least one focal direction PP'. Hence in general there 
are n — 1 of these directions, which, however, like the correspond- 
ing roots of (20), may be real or iiuaginary, distinct or coincident, 
or (in the case of multiple roots) may be capable of having an 
infinite number of determinations. 

Tn fact, the properties of the secular equation, as noted in the 
preceding section, hold for syrnwetrical determinants of the type 
(24), while the left-hand side of (20) is not in general of this 
form. There is, however, an important category of congruences 
with this characteristic, which we shall now consider. 

Nonnal con^rmnces of If our congruence {rt) is normal, 

then by (20) 

ynhj yn}h j ~ 1 , 2 , ... 1 ). 

We can therefore substitute -My»iA/+yn/V») (26) 

at once becomes identical with (24), which defines the canonical 
directions. It follows that the canonical and focal directions 
coincide. Hence on the one hand we have the geometrical inter- 
pretation of the canonical directions; and on the other, from 
the properties noted at the end of the preceding section, we have 
the property that the focal directions are always real, and are 
in general determinate and mutually orthogonal; and further, 
that in the case of indeterminateness, when there is an infinite 
number of them, it is always possible (and in an infinite number 
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of way») to choosd n — 1 of them which shall be mutually ortho- 
gonal. 

As we are dealing with a normal congruence, there exists 
(by definition) a family of surfaces 

f(x^, X 2 , . . . = constant, 

which are cut orthogonally by the straight lines of the con- 
gruence; these lines therefore constitute the common normals 
to all the surfaces of the family. If we fix one of these surfaces, 
and associate with every point on it the n — 1 focal directions, 
we shall get n — 1 mutually orthogonal congruences of lines on 
the surface. These lines are called Une^ of curvature, by an obvious 
generalization from the lines so determined in the case of sur- 
faces in ordinary space (n = 3). In fact, given such a surface, 
say a, the normals to it form a normal congruence (since they cut 
a and the surfaces parallel to a orthogonally); and if we consider 
the two focal directions at every point of a we arrive at precisely 
the ordinary definition of the lines of curvature as those lines of 
a along which the normals to o generate a developable ruled 
surface. 

General Case,—li the congruence (n) under consideration is 
not normal, then in general, as we have seen, the focal and 
canonical directions at a generic point P of a ray r do not coincide. 
In order to find an interpretation of the canonical directions in 
this case, we should therefore have to examine in greater detail 
the behaviour of the rays of the congruence which are infinitely 
near r. 

For n = 3 there is a classical discussion by Kummer,^ giving 
a very illuminating interpretation of the canonical directions,^ 
and pointing out in particular that the directions which bisect 
the angles between the canonical directions also bisect the angles 
between the focal directions (when the latter are real). 

We shall leave the question at this point, merely pointing 
out to the reader the possibility of analogous interpretations for 
n > 3. 

^See e.g. Bianchi: Lezioni di Gconietria Differenztale, Vol. I (third edition; 
Bologna, Zaniohelli, 1922), Oh. X. 

®Cf. T. Levi-Oivlta; **Sulle congruenze di curve*', in Rend» della JL Ace. dei 
Lincei, Vol. VIIT (first half-year, 1899), pp. 239-46. 
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Physical Applications 


CHAPTER XI 

Evolution of Mpjchanics and Geometrical Optics; 
Their Relation to a Four-dimensional World 
accordino to Einstein 


1. Hamilton’s principle for a free particle. 

We start from the equations of motion of a material particle 
in a conservative field. Let U be tlie jK)tential for imit mass. 
The equations of motion, in Cartesian co-ordinates (referred 
to fixed axes) y^, y^, y^, are 

(i-1,2,3), ... (1) 

where as usual dots represent difierentiation with respect to the 
time U If we denote the square of the line element described by 
the moving particle in tlie small interval of time dt by 

- 'h,dy^ 


and if v is the velocity of the particle (in absolute value), then 


= 





L = 

287 


Putting 
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it is known that the equations (1) can be summed up in the 
equation of variation 

SfLdt =0 (2) 

which expresses Hamilton’s principle. 

Let us fix our attention for a moment on (2). It implies an 
interval of integration fixed arbitrarily in advance; and 

the vanishing of the left-hand side of (2) for variations of 
the y’s, zero at the extremities but otherwise arbitrary, is equiva- 
lent to the equations (1) being satisfied in the same interval. 

This case, in which t does not vary (i.e. =. 0), is the 

simplest application of Hamilton’s principle. Various generaliza- 
tions, however, in which t also varies, cither freely or subject to 
certain conditions, have become classical. Wo shall shortly have 
occasion to discuss one of these generalizations which concern 
the equivalence between the equations (1) and (2). Meanwhile 
we may note that if the co-ordinates are changed in any 
way, so that the Cartesians yg, y^ are replaced by any 
set of three curvilinear co-ordinates, or more generally by three 
Lagrangian parameters connected with y^, yg, y^ by 

relations which may involve the time and which are regular and 
reversible in the field considered, namely, 

(^a): == 2/2> (A = 1, 2, 3), 

or, solving with respect to y^ {i = 1, 2, 3), 

{TsY y, x.^, t) (i = 1, 2, 3); 

then if we insert these expressions in L, it becomes a function 
L{x \ x\ t) of the arguments X/,, cr,, {h 1, 2, 3), t, quadratic 
(in general not homogeneous) in the x’s. 

As we propose to consider L as an invariant, it follows that 
(2) will hold for the Lagrangian parameters x, and we have only 
to find its explicit form. Calculating the variation and integrating 
by parts in the usual way, w^e easily find 

.3 

SJLcU = — dt, - (3) 

where for shortness we have put 

d dL dL 
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(Lagrangian binomials). The dynamical equations then take the 
form 

Tft - 0 (A = 1, 2, 3 ) . . . ( 4 ) 


(known as Lagrange’s form); and it is to be noted that, in virtue 
of the invariance of the left-hand side of (3), the quantities 
constitute a co-variant tensor, as pointed out in a similar case 
in Chapter V, p. 110. It follows that the equations ^4), i.e. 


d dJL_dL 
dt dx^ dx^ 


(A- 1,2,3), . . (4') 


are invariant (cf. Chapter V, p. 1 10) with respect to the transfor- 
mations (Tg) which leave L invariant. 


2 . Time as a fourth co-ordinate. Space-time. World lines. 

An obvious consequence of Lagrange’s equations (4') is the 
identity 



dL .] 



Now suppose that in the interval (fp, the independent 
variable t is also made to undergo a variation wbic;h is zero 
at the extremities and is otherwise arbitrary. Since the are 

dcr • 

unchanged by this, while the derivatives undergo the 

increments ^ 

. dSt 


it will at once be seen that, by an obvious integration by pail®, 
the contribution of the variation oi t to hjL dt, namely, 

can bo put in the form 



d 

dt 




which, as we have just pointed out, is zero in consequence of 

( 4 '). 
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It is therefore possible, in dealing with the Hamiltonian 
equation (2), to apply exactly the same treatment to the spaoe^ 
co-ordinates Xg, and the time t 

To simplify the argument, consider the four-dimensional 
manifold V4 corresponding to four parameters x^, t; the mani- 
fold, in which space and time are simultaneously represented, 
may be called space- time. 

A set of three equations 

= ^t (0 (i 1. ‘*2. 3), 

or, in terms of kinematics, a motion, corresponds to a curve 
belonging to F4, and reciprocally. Such a curve is called a world 
line; it is an obvious generalization of the plane diagram (in 
which the abscissa is the time and the ordinate the space described) 
used to represent the circumstances of motion in a given trajectory. 
Adopting this expression, we can say that the integral curves of 
the equations (4') are all those world lines of F4, and only those, 

for which the variation of the integral jLdt vanishes, the ex- 
tremities being fixed, 

3. General transformations of co-ordinates in space-time. 
Simultaneity. 

The most general transformation of parameters in F4 ob- 
viously includes three equations of the type ( T^), which substitute 
for the Cartesian co-ordinates y^, y^ three independent com- 
binations of them, Xi, X2, also involving t; and a fourth 
equation wliich substitutes for the time t a further combination 
^39 0 (independent of the tliree preceding equations). 
This new parameter is sometimes called the local time, as it 
depends not only on the original time, but also on the point in 
question. A transformation (T4) is thus represented by the 
formula: 

j 2/2, 2^3, *), 


{T^y. 


An obvious but important property of such a transformation 
is the following. If two events are characterized by different 
values of y-^, y^, ^3, but the same value of t, it will in general 
happen that after the transformation is effected, not only the 
space co-ordinates x^, of the two events will be different, but 
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also the time co-ordinates cCq. This implies that two events which 
appear simultaneous with reference to the system ya, t 

are not in general simultaneous with reference to the system 
of the aj’s; simultaneity is therefore relative to the system of 
reference. This evidently does not happen when the first of the 
relations (TJ is of the type Xq ~ or in particular 

so that the reduces to a ( T3). And it is precisely in order 
to avoid any conflict with the intuitive concept of (absolute) 
simultaneity that only transformations of the type (T^) are 
considered in the classical physics. But a more acute criticism 
of this intuitive concept shows that, far from being a logical 
necessity, it has an empirical origin based on experimental 
results which can only be taken as a first approximation; it is 
therefore reasonable, in view of the speculative nature of our 
considerations, to admit the possibility of a more general con- 
ception of simultaneity. 

4. Einstein’s form for Hamilton’s principle. Its invariant 
character under any transformation of co-ordinates. 

So long as L is taken to be invariant, the form of the integral 
j Ldt is evidently not invariant for a transformation ( T^), since 
in general dt is replaced by an expression linear in all four 
variables x. We might try to replace the base L by something 
more general; it would then be possible to reach the required 
result, but the method would be complicated and infertile, and 
the loss in simplicity both of concept and of form would be 
much greater than the gain in generality. 

But it is not diflicuJt to arrive at a significant form which 
shall be invariant for every (T4) if we regard Hamilton’s principle 
as an approximate result, the degree of approximation being of 
course sc high that in ordinary applications, astronomical as 
well as technical, the difference between it and the rigorous 
hypothetical principle shall be imperceptible. This will evidently 
be the case if the order of magnitude of the difference between 
the two, with respect to the values given by the ordinary theory, 
is not higher than the hundred-millionth (10~®). 

A concrete application of this criterion is as follows. Let c 
denote a constant veloeity, large in comparison with the greatest 
velocity attained in the motions we propose to discuss. We 
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V 

shall consider quantities comparable with jS = - as small quan- 

c 

titles of the first order, and we shall consider quantities of the 
second and higher orders as negligible in comparison with unity; 


we shall also suppose that the ratio ^ is similarly negligible. 


We note that this will in fact be the case if c is comparable 
with the velocity of light, not only for ordinary problems of 
terrestrial motion, but also in celestial mechanics. In order to 
see this, we need only suppose that v is a planetary velocity and 
U the Newtonian potential which determines it, so that by a 
well-known result U {m the field of motion of the planet) is of 
the same order of magnitude as 

We may take 30 kilometres per second, corresponding to the 
earth’s motion in its orbit, as the order of magnitude of v. In 

round numbers, c-= 300,000 km. /sec., so that wc have - =r= 10""* 
(approx.), and therefore ^ 


and - = 10“^ (approx.), 
c- c- 


We shall see farther on, in §§ 8, 0, and 16, that physical con- 
siderations lead us to take for c precisely the velocity of light. 
Since 8^ must vanish at the limits of integration, we have 



so that L can be replaced, as the integrand of (2), by 
C* - £ - (1 - 


The terms — though negligible in comparison with 

c** 

unity, are essential in order to prevent the equation of variation 
from reducing to an identity. Terms of higher order may however 
be neglected. We may therefore write 

c 
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so 


that, omitting the constant factor c and writing — instead of 

aJF 


the equation of Hamilton’s principle (which, as just pointed 
out, is equivalent to 8 f{c^ — L)dt = 0) can be replaced by 


or, putting ds^ = {cr — 217) dt^ — ... (6) 

by hjds = 0 (6) 


Since the value of dl^^ referred to Cartesian co-ordinates is 

8 

the just introduced is a quaternary diflFerential 
1 

quadratic form; it is indelinite, since for real and infinitesimal 
values of dt, dy^^ dy<»y dy.^ it can have both positive and negative 
values. At tlie same time it is to be remembered that, j'or the 
'phenomena of motion at preseM under consideration, we ham always 
ds^ > 0 . 


To show that this is so, note that, taking out the common 

dl 

factor c^dt^ and again rei)lacing we can write 

az 


ds^ — d£- 




this proves the assertion, since the quantity in brackets is cer- 
tainly positive when the quantitative relations stipulated at the 
beginning of our argument liold. 

We may now note that if the ds^ expressed by (5) is con- 
sidered as the square of the line element of the manifold 
(which contains both space and time), then (6) represents the 
characteristic equation of geodesics of (cf. Chapter V, p. 130). 
It is true that the metric of this manifold is characterized by an 
indefinite quadratic form, but, as w-as pointed out on p. 142 of 
Chapter V, this does not introduce any real complication so long 
as we limit our considerations to lines wholly constituted of 
elements for which ds^ > 0, as it is in the present case. We can 
therefore say that the f>roposcd modification of Hamilton's 
princifile imposes a metric limitation on the space-time manifold 
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F 4 , and that the mechanical problem of the motion of a free 
particle under the action of forces derived from a potential has 
been transformed — with an alteration of the laws of dynamics 
which is quantitatively very small — into the purely geometrical 
problem of the determination of the geodesics of a certain four- 
dimensional rnetric manifold. 

If for the arguments Vz we substitute any four inde- 

pendent combinations of them whatever, Xq, x^, by means 
of a substitution ( T 4 ), will lose the special form (5) and assume 
the general type of a quaternary quadratic, 

3 

= ^aQikdxidxg, ( 6 ') 

O 

whose ten co-efficients Okd will naturally be, in general, 

fimctions of the x*&. 

The essential point is that, ds^ being invariant, ( 6 ) is also 
invariant for any choice whatever of co-ordinates in V^. This 
constitutes a marked superiority of ( 6 ) over the original form of 
Hamilton’s principle. From the conceptional point of view it is 
also to be noted that this change realizes PCinstein’s fundamental 
concept of general relativity, which requires tliat it shall be possible 
to express the laws of any physical phenomenon whatever in 
a form which is invariant for every possible choice of co-ordinates, 
both of space and of time, without the time having to hold the 
privileged position assigned to it in the classical theories. 

5. Mass and ener^: views suggested by the modification of 
the dynamical law. 

We shall examine in detail the form taken by the dynamical 
equations of the free material particle when the classical Lagran- 
gian fimction L is replaced by. the function 

— — c%/<^ — — 2U, 


which we shall write briefly in the form 

^ K, 


putting 
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Substituting — <?K for L in Lagrange’s equations, they 


become 



d dK 

dK 

or since 



dt dyi 





dK __ 

Vi 





c^k 

we have also 

d 

Vi 

1 dU 

(i = 


dl 

K 

K dy, 


( 7 ) 


Remembering that K differs by very little from 1, we see 
that quantitatively these equations differ by very little from 
the equations (1). Considering them from the point of view of 
form, and comparing them with the cardinal equation of classical 
dynamics , 

T = p 
dt 

(where Q is the momentum and F the force), we see that the 
momentum per unit mass of the old theory is replaced in the 

new by the vector whose components are For a particle of 

A 

mass mQ and velocity V the vectorial expression for the momentum 
will therefore be 


Q 


moV 

K ' 


If we wish to retain the formal property that the momentum 
is the product of the mass by the velocity, we must take as the 
mass not the constant m^, an intrinsic property of the body in 
motion, but the quantity 

m- 

which will be seen to depend on the velocity and the field of 
force. Neglecting the latter so as to fix the attention in the first 
place on the motion as it depends on the velocity, we reach the 
expression 

nin 




m = 
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from yrLich it apj^ears that m increases as the velocity increases 
and would tend to infinity if the velocity could reach the value 
c. In this sense we say that the typical velocity c, introduced 
to give an invariant form to Hamilton’s principle, is a limiting 
velocity. 

We now proceed to examine the concept of energy in the 
light of relativity mechanics. 

In the classical mechanics, given a generic Lagrangian func- 
tion L{y I y) (where L does not exj^icitly contain the time t)^ 
the corresponding expression for the Cinergy is 


H = 


^ dL . 
s, Vi 
1 ^Vi 


L-, 


( 8 ) 


in the case where L can he broken up into a part T {y | if) homo- 
geneous of the second degree in the y's, and a part U independent 
of them, this becomes, by Euler’s theorem, 

H == T{y\y) ~ V{y). 

It is known that T can be interpreted as the kinetic and 
— tr as the potential energy. Since we have replaced the classical 
L by the expression 

L* — c>/c~~v-~~2U = —c~K, 


we must now determine the new expression H* for the energy 
per unit mass. 

Applying (8) we get 


//* = 


i!. dL*. 

^ 7 i/i 

1 dyt 


L* 




substituting from equations (7) and using the expression for K 
we get finally 


H* = 


c2 - 2U 


K 


6-2 - 2U 



( 9 ) 


We see therefore that the energy cannot be divided into a 
part due to motion and a part due to position. Further, for 
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V = TJ = 0, the energy does not vanish, but remains equal 
to c®: a remarkable fact, the interpretation of which will be 
seen in a moment. 

Expanding the radical in series we can write 


and therefore, retaining only terms of the second order, 




To this degree of approximation, therefore, tlie energy is 
composed of a kinetic part expressed as usual by a part due 
to position which is still given by — f/, and in addition a con- 
stant part (i.e, a part independent of both position and velocity) 
equal to this last part is called the intrimic energy of unit 
mass. A material particle of mass (at rest or moving under 
no forces) will thus have intrinsic energy Now considera- 

tions of a different nature lead us to assign to this intrinsic 
energy a much more profound significance than that of a mere 
additive constant of conventional value; it is in fact taken to 
repre^sent the effective atomic and molecular energy stored up 
in the body to the extent of 25 million kilowatt-hours for every 
gramme of matter. The possibility of the existence of this enor- 
mous quantity of latent energy is shown by phenomena of 
radioactivity; a sufficient example is the fact that any small 
mass of radium is capable of giving off for years and years, 
without perceptible modifica.tion, enough heat to raise an equal 
mass of water from 0° C. to boiling-point in every hour. The 
supply of heat would last for a very long time; more than 2500 
years for radium, and for other radioactive elements a period 
comparable with geological epochs. While radioactivity is not 
a general property of all bodies, yet it demonstrates the fact that 
(at le.ast in certain cases) matter contains an enormous store of 
energy, and in this form the assertion can be generalized so as 
to extend to every atom of ponderable matter. 

Admitting the possibility of the existence of this intrinsic 
energy, the foregoing considerations result in our assigning 
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to it tlie value If instead we return to the expression (9) 

for the total energy, and suppose that the potential Z7 is zero, 
we find for the total energy (kinetic and intrinsic) localized in 
a body whose mass when at rest is mg, the expression 



and remembering the expression for the mass m as a function 
of the velocity we can also write this as 

E -- mc2 (10) 

This result shows us that there is a proportional relation not 
only between the mass of the body when at rest and the intrinsic 
energy, but, more generally, between the mass and the total 
energy localized in the body. It also suggests the hypothesis that 
to any form of energy there must be assigned a mass connected 
with it by the relation (10); and, mce versa, that every mass m 
corresponds to a quantity of energy mc^. This hypothesis is 
supported by other considerations, and leads to the view, of 
primary pliilosophical importance, that energy and matter may 
be considered as different manifestations of one single entity, 
which appears as ordinary matter when it is, so to speak, suffi- 
ciently concentrated, while it appears as energy in widely different 
forms when there are no condensation nuclei present. 

6. Einstein’s form for the principle of inertia. Restricted 
relativity. 

The equations of motion in the original Newtonian form (1) 
imply, as is well known, a state of uniform motion when the forces 
are zero or, which comes to the same tiling (except for a non- 
essential constant), for U — 0. Equation (2), which is rigorously 
equivalent to (1), therefore defines states of uniform motion for 
[7 = 0 . This property also holds for the new Einsteinian form 
(6) of Hamilton’s principle, though it is not rigorously equivalent 
to equations (2). Before proving this we may point out that, for 
[7 = 0, (5) gives 


dso^ = 


( 11 ) 
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By a mere change of the unit of measurem^t of time (the 
advantage of which will be seen shortly), i.e. by putting ct = ^q, 
this qua^atic form becomes 

and referring the space to orthogonal Cartesian co-ordinates, 
ds^ = dy^ — — dy.^ — dy.^. . . (11') 

This is analogous to the ordinary expression for the ds^ of 
a Euclidean F4 in orthogonal Cartesian co-ordinates, except for 
the signs of the co-efficients, which make it indefinite; in this 
case the index of inertia ^ is 3. Hence the F4 with a metric of 
this kind is called pseifdo-Euelidean; the system of co-ordinates 

2/1 » which gives this form to ds^^ is called pseudo- 

Cartesian or Galilean. 

It will sometimes be convenient to put the expression for a 
pseudo-Euclidean ds^ back into the general form (6'); for this 
purpose we introduce the symbols 

1 1 for i — h --- 0, 

— 1 for V == Jfc =# 0, 

0 for i =4= A; 

(the notation being similar to that introduced in the note on 
p. 55 of Chapter III), We can then say that in pseudo-Cartesian 
co-ordinates the co -efficients of ds^ are 

Oi. = 

We then have also, as is easily verified, 

S'’" = gik - si" (12) 

Returning to the property enunciated at the beginning of 
this section, we note that, for U — 0, the expression for 
becomes 

^Tho index of inertia is the nuipber of negative coefficients of a quadratic 
form when expressed (in any way) in the canonical form (i.e. so that it contains no 
product terms). 
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and that (6), which becomes 

Sfdsf, = 0 , ( 6 ') 

can be written 

SjLldt -= 0 . 

The corresponding Lagrangian equations, from the fact that 
Lq does not depend explicitly on the at once give the three 
first integrals 

dL^ 

= constant {i == 1,2, 3), 

whence there follows the constancy of all the 2 /,/s (the principle 
of inertia). Now consider a particidar, but very important, 
category of transformations spocified as follows. From 

the set of four co-ordinates {i, yg? .%) ^ 

(?, yi, y 2 > ^ 3 ) which the form (11) of remains unchanged, 
this being understood in the sense that the transformation 
formulae are to give identically 

c2 - S, < - 2 

1 

The equation (6^) then ensures that in the new co-ordincUes 
also, interpreting t as the time and y^, yg, as Cartesian co-ordinates 
the motion will appear uniforin {restricted relativity). 

Transformations of this kind were effectively constructed by 
Lorentz, so that they may be caUed Lorciitz transformations; 
we shall denote them shortly by (A) and discuss them fully 
in Section 8. Meanwhile we may indicate the characteristic 
property, pointed out by Professor Marcolongo, that, if we put 

4 

v/ — f (^ == y 4 , so that ds^^ takes the Euclidean form — 2* dyi^, 

4 1 

a Lorentz transformation leaves imchanged the form ILidy^, and 

1 

thus (here too apart from any question of imaginaries) is sub- 
stantially identical with motion in a four-dimensional Euclidean 
space. 

To close these remarks on the effective existence of these 
special transformations (A) we may note an important corollary . 
Every (A), as we have said, transforms a generic uniform motion 



LORENTZ TRANSFORMATIONS 


301 


into a new motion which is also uniform; but it is not possible 
to assert that the velocity is imaltered by the transformation. 
There is, however, at least one case in which this happens, namely, 
motion in which the velocity is that very large constant velocity 
c which we originally introduced in order to modify Hamilton’s 
formula in a way which should be quantitatively imperceptible, 
but fertile in its results. 

In fact, for a motion in which the velocity is c (with respect 


to the parameters t, ,%), we have obviously c® 

therefore ^ ^ ^ 


^:and 

dt^ 


In view of the invariance, not only of but also of the 

special form dt^ — S, dy^ which we have given it, we have, on 
1 

passing to the new variables f, y^ by a Lorentz transfor- 

mation, 

c^di'^-Xidy,^ = 0 
1 

for the transformed motion as well as for the original one, and 
therefore the velocity is c. 


7. The kinematics of rigid systems. Ordinary method of 
approach and possible variants. 

In the foregoing sections we have been led to modify (very 
slightly in ordinary conditions) the dynamics of a material 
particle P, i.e. tlie relation between the motion and the disturbing 
force. Notliing however has been, or need be, modified as regards 
the kinematics, i.e. the description of the phenomenon of the 
change of pc^sition of a point P with respect to an assigned 
observer S, or in other terms with respect to a Cartesian system, 
in a certain interval of time. For convenience (the reason for 
this choice will be clear in a moment) we shall denote these axes 
of reference by O y^ y.^ y^, and the time by L 
The equations of motion of P, 

(i-i, 2 , 3), . . . (13) 

the velocity V as a vector of components (i = 1, 2, 3), the 
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acceleration, &c., will all be as in tbe ordinary case. In particular, 
there is uniform motion when V is constant, i.e. the y^B are 
linear functions of t. In this cose, taking one of the axes, say 
that of the y^’s, parallel to V, the equations (13) can be put in the 
simplified form 

yi = Si + vt, y^ = yl, ^3 ^ yg, . . (14) 

where v obviously denotes the velocity in the scalar sense (the 
component of V along and y%, the initial values 

of the co-ordinates y^^ y^ of the moving point. 

As is well known, there are in ordinary kinematics two ways 
of defining rigid motion and of investigating its problems; these 
are briefly as follows: 

(1 ) A rigid system' is defined as a system consisting of any 

number of points P, P', . . . of co-ordinates • 

(i — 1, 2, 3), which move in such a way that their mutual 

distances apart remain unchanged; i.e. so that for any two points 
whatever of the system, P and P', and for any movement of 
these points, the relation 

s. (y, — y,f = 

i 

holds, the quantity on the right being constant (geometrical 
characteristics of the moving system). 

In these relations and in their differential consequences 

are summed up all the properties concerning simultaneous 
positions, velocities, &c., of the various points of the 

system. 

(2) The groimd covered by the equations just given for the 
relations between pairs of points is, so to speak, divided into 
two parts, the first expressing the intrinsic circumstance (i.e. 
independent of the system of reference S) that when the 
moving system changes its position with respect to S it keeps 
its configuration unchanged. This is equivalent to the pos- 
sibility of placing at the points P, P', ... an observer S 
rigidly attached to the body, who can be represented as usual 
by an orthogonal trihedron Oy^y^y^ with respect to which 
the position of each separate point of the moving system remains 
unchanged. In other words, the co-ordinates y-, ... of 
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these points with respect to these axes attached to the body do 
not vary with the time. 

At this stage the argument usually is as follows. In order 
to determine the position, with respect to the original system of 
reference of the whole moving system at a generic instant, it 
is only necessary to place the trihedron Oy^y^y^ (attached to 
the body) in its proper position with respect to Oy^y^y^. 
Thus we again have to deal with transformation formulce (variable 
from moment to moment) between two systsms of orthogonal Car- 
tesian axes^ and therefore of the type 

:y, - + {i == 1, 2, 3), . (16) 

1 

where denotes the cosine (variable with the time, if the motion 
is not one of pure translation) of the angle between the fixed axis 
Oyi and the moving axis 0 ^ 4 ., an^ is a function of the time 
(linear if the motion reduces to a uniform translation). 

The proposition in italics, or the equivalent group of formulee 
(15), constitutes the complement of what may be called the 
intrinsic rigidity of the body (the existence of the trihedron 
attached to the body); the combination of the two gives us once 
again the kinematics of a solid body in its classical form. 

But if we analyse this complement a little iurther, we find 
that we can modify to some extent the ordinary idea of the 
motion of a solid body without giving up either intrinsic rigidity 
or the validity of Euclidean geometry. 

We need only introduce the hypothesis (independent of both 
the geometry and the kinematics of the point) that the measures 
of the distances between the points P, P', . . . (and therefore also 
of angles) of our solid may differ according as they are made 
by an observer atta.ched to P, P', ... or by the fixed observer 2. 
While granting that the two observers may disagree as to the 
measures, it is to be borne in mind that, by hypothesis, the 
measurements are made by each of them in accordance with a 
Euclidean metric, and that (as in the classical scheme of things) 
the rigidity of the motion must always be respected, from the 
point of view of the fixed observer 2 as well as of the other. 
This requires that every distance apart of two points P, P', 

. • . of our system must remain unchanged in time, whether the 
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distance is calculated by S or by the other. For this it is neces- 
sary and sufficient that the transformation formulae between the 
y'& and the y’&, 

Vi = /i (l/i. ^2. yz> t) (i = 1, 2, 3), . . (16) 

where the//s are a 'priori unknown functions, should be such as 
to make 



independent of t at every instant, whatever may be the differen- 
tials dy^. 

We get an obvious case in whicli this condition is satisfied if 
we suppose tliat the transformation formulae are linear in the 
y/s (though not necessarily with respect to t), and in particular 
that they are of the form 

s 

Vi == '^kC,kyk + <i>L(f')> .... (18) 

where the c’s are completely arbitrary constants, subject only 
to the qualitative condition that their determinant || |1 does 

not vanish. It should be remembered that in the equations (15) 
the coefiicients were in addition direction cosines (in some 
cases variable with the time) for two sets of orthogonal 
axes. 

A transformation of the type (18) between the j/’s and the 
^’s is, at every instant (i.e. for any assigned value of t), linear, 
and therefore homographic, or rather affine, so that straight lines 
are transformed by it into straight lines. From our point of view, 
this means that curves which appear to the observer S to be 
straight lines are so also for the observer S, and inversely. 

It would not be hard to show that, if we impose on a trans- 
formation (16) the double condition of making dl? or the right- 
hand side of (17) independent of t, and also of keeping geodesics 
unchanged, we necessarily reach, if not an affine transformation 
(18), at least the product (in the sense of a product of operators) 
of an affine transformation by a rigid motion in the ordinary 
sense of the term. By a suitable choice of the trihedra of reference, 
the passage between the y’s and the y’s can thus be effected by 
applying in succession the two following transformations: 
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(1) An affine transformation given in its canonical form, 

1. e. by means of equations of the type 

Vi = y’i = y& = hyz> • (i9) 

where the are positive constants; 

(2) a transformation (15) between the y^s and the y* ’s. 

We shall not spend time on the elementary consideratiops 
which lead to this conclusion, and shall merely point out that the 
coefficients k of the equations (19) determine the deformation 
consequent on the affine correspondence between the y’s and 
the y' ’s, while in the second change, from the y^ ’s to the ^’s, there 
is no further deformation. 

It follows from this, taking (19) into account, and considering 
the two observers S and 2, that a segment having the same direction 
as the axis Oyj, and of length 1 with respect to the observer 2 attached 
to the hodyy will appear to the observer 2 as having the length kjl; 
hence the factor is called tlie coefficient of elongation. The 
elongation ’’ of unit length is accordingly — 1; this represents 
an expansion or contraction according as > or < 1. The 
formulaa (19) of course provide, more generally, information as 
to the alteration in length of segments (and therefore of vectors) 
in any direction whatever. If the coefficients {i ~ 1, 2, 3) 
are the direction cosines with respect to the axes Oy^y^y^ ^ 
generic segment and I the length as it appears to the observer 

2, we obviously get, for the length I as estimated by 2, 

I = ly/ kf + ief af- + kf af. 

Returning for a moment to the ordinary equations (15) of 
rigid motion, we shall fix our attention in particular on the most 
elementary case (which will serve as a guide and a basis of com- 
parison in the argument of the next section), that of uniform 
translatory motion. We can then take the trihedron of reference 
Oy^ y^ y^ with one of its axes, say parallel to the direction 
(by hypothesis constant) of the velocity, and we shall take the 
trihedron Oy^ yg J /3 attached to the body as coinciding with the 
fixed trihedron at the initial instant t — - 0. 

The motion being translatory, the axes attached to the body 
remain parallel to the corresponding fixed axes throughout; and 

( ]> 065 ) 
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if V w the velocity of translation, the formulas determining the 
motion evidently reduce to 

^2 = ^ 2 . ^3 == ^ 3 * • • ( 15 ') 

We have thus again reached the typical equations (14) for 
each point (yi, yg* ya* constant) of the body. 

* 8. Bomerian units. Study ot Lorentz transformationse 

The equations (15'), which define in the simplest form an 
ordinary uniform translation, can obviously be associated with 
the identity 

t == t. 

We thus get a quaternary transformation between (^j^^ ^ 2 ? 2 / 3 ? 0 
and (^ 1 , ^ 2 * ^39 which we shall denote by T. 

We next observe that the most general representation of a 
uniform translation, with arbitrary choice of the two trihedra 
(one fixed, the other attached to the body), subject to the sole 
condition that the origins coincide initially, can be reduced to T 
together with two rotations independent of In fact, denoting 
as before the two trihedra of reference (fixed and moving with 
the body) by S and S, we shall denote by 7? a rigid rotation of S 
(round the origin O) which brings its axis into the direction 
parallel to the velocity of translation. Let H' be an analogous 
rotation (round O) of the trihedron S; and It the inverse rotation. 
Then the transformation formulae between t) and 

iVv yz 9 y 39 0 represented by the symbolic product 

RTR, 

Now consider the well-known kinematical deduction from the 
classical method of representing rigid motion, namely, that if 
we consider any velocity cu whatever with respect to S (c being 
the modulus and u the versor), this becomes cu -f- v with respect 
to S, if V is the vector representing the velocity of translation of 
S with respect to S. This, as has been observed, is in contra- 
diction with the results of experiment, at least as regards the 
velocity of light, for which c in cm. per second has the par- 
ticular value 3*10^^, which teniains unaltered, even when com- 
pounded with a uniform translation (Michelson-Morley experi- 
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meat). The Avish to restore concord between theory and experi- 
ment leads us to modify the equations (15')> with them, if 
necessary, the equation < = <, in such a way that the relation 
ds/ = dsQ (not merely dl^ = dP’) shall be rigorously satisfied; 
i.e. so that there shall be an identity between two quadratic 
forms involving not only the space co-ordinates but also the 
time. 

Special transformations. We propose to try to modify these 
transformation formulae (as usual very slightly, at least for small 
values of v) so as to make dsf invariant. For this purpose we 
shall have to replace t sometimes by the variable 

*/o •"= 

and sometimes by the imaginary variable 

^4 = LCX (t - sj — 1). 

• • • *0 

With this change, putting also - — )8, the equations (15') and 
I — I become either ® • 

^ Vz = y2> Vs = yo ^ yo ■ ( 20 ) 

or 

^1 = 2 / 1 - ‘^^ 4 . ^2 = yz> ys = 2 / 3 . yi = y*,- ( 20 ') 

The real variable introduced here is only the time measured 
by choosing as unit the time taken by light to traverse unit 
space. Thus the velocity of light is 1 and the dimensions of time 
become the same as those of length. The character of a primary 
magnitude ordinarily assigned to the time thus disappears, the 
unit of time being linked up with the unit of length by means of 
the phenomenon of the propagation of light. It will be convenient 
to apply the term “ Romerian ” ^ to measurements of time 
made in this way; we shall similarly use the term Romerian 
velocities ” (which are pure numbers) for velocities referred to 
the Romerian time ?/q. It obviously follows from the equation 

= ct that a Romerian velocity is only the corresponding 
ordinary velocity divided by c; in particular, the quantity 

^Frona O. ROmer (1644-1710), who was the first to discover and detcrniine 
the velocity of light. His method was based on observation of the eclipses of 
Jupiter’s satellites. 
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V . * 

j8 = - just introduced is only the Romerian velocity of trans* 
c 

lation. 

In accordance with Marcolongo’s remark quoted on p. 300 , 
the transformations we are in search of must leave invariant the 
differential quadratic form which we can write (introducing 
the imaginary variable ^4) in the form 

— dso® + dy^K 

In order to obtain particular transformations satisfying this 
condition, we shall first consider linear homogeneous trans- 
formations. These will at once result, as we have already pointed 
out in the preceding section, in the condition ( 17 ) being satisfied, 
which interprets the transformation as equivalent to a rigid 
motion (if not in the ordinary sense, at least in the intrinsic sense 
there specified). As we are dealing with linear (and homogeneous) 
transformations, the invariance of the differential form — ds^ 
implies that of the algebraic quadratic form 

— q = yi+ y-i + yi -f y^, 

and reciprocally. 

Starting from the equations (20') we shall examine whether 
we can reach the required result if we keep the co-ordinates 
Vz invariant, i.e. if we suppose 

Vz = Vz = Vz* 


We have thus to find a linear transformation between the 
variables and (^i, ^4) which will leave invariant the expres- 


sion 




Hence (apart from the question of imaginaries) we have to 
discuss a rigid rotation, roimd the origin of co-ordinates, in the 
plane 2/1, y^, and therefore of the form 

Vi Vi cos<^ — y^ sin^, 
y^ =--= 2/1 sin^ + 2/4 cos^. 

If we introduce the real variables y^^ y^ instead of 2^4, ^4, it 
will be seen that the necessary and sufficient condition for the 
disappearance of imaginaries from the ultimate formulee is that 
the coefficient of should be real and that of imaginary in 
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the first equation, and vice versa in the second. To obtain this 
result, if> must be a pure imaginary; in fact, putting 

(ft = Ilfs (with tfs real), 

we get 

COS^ = COSLlff — COShf/r, 

sin^ = sini.^ =- tsinh^, 

where cosh^ and sinhi/r as usual denote the hyperbolic cosine and 
sine. 

Hence our transformation formulae take the form 

§1 ^ ?h cosh0 + jyo 'j 

= ?h i . . . . (21) 

§3 ^ ys 

sinll^ + CO8h0 

If we remember that in the equations (20) the pure number 
jS is in ordinary cases fairly small, we cee that in these cases the 
equations (21) differ quantitatively by very little from the 
equations (20), provided we suppose ^ sufficiently small for cosh^/ 
and sinh</r not to differ by very much from 1 and 0 respectively. 

But we get a precise kinematical interpretation of the para- 
meter ifs on which the transformation (21) depends if for instance 
we fix our attention on the origin O of the moving axes, i.e. on 
the point wliose co-ordinates are ^ yz ~ ^ 

generic value of this last parameter denoting the time 
(Romerian time) as it appears to the observer S. For tlie fixed 
observer S, with respect to whom represents the time (likewise 
Romerian) and ?72? y^ position, we have, corresponding to 
O and a generic value of 

coship, 

while and y^ vanish. Hence the motion of O is rectilinear, and 
the ratio 

— = tanh^, 

Vo 

which is obviously constant, is the (Romerian) velocity. Denoting 
this ratio by we have in the equation 

tanh^ = 
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the required kinematical significance of the parameter More 
generally, the same quantity j8 stands for the Bomerian velocity 
of any other point P rigidly attached to L. In fact, if yj, y^, y^ 
are constants and y^ a generic value, the result of differentiating 
the equations (21) is to give 

dyi = sinh^dyo, = dy^ 0, dy^ = cosh^d«/o, 

whence it follows that 

tanh^ = 

Applying the ordinary formulae 

cosh = , sink 

w \ — tanh* 


(8. 

Q.E.D. 

tanh 

^ 1 tanli®’ 


we can put the equations (21) in the form commonly used (the 
special Lorentz transformation) 

^ 

Vi V'i ^21') 

Vs, Vz 

r. — ya + ^y\ 

or, using the ordinary instead of the Bomerian unit of time 
(i.e. i and t instead of and with d, y^ = ct), 

_ + ' 


( 21 ") 




It will be seen that the necessary condition for these formulae 
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to be real is < 1, or v < c; which once more demonstrates 
that the velocity of light c is a limiting velocity. 

It can be easily verified that the formulae obtained from 
(21') or the equivalent (21") by solving these equations with 
respect to f/i, ^ 2 * t/o ^ differ from the first set only by 

the change of v into — v (and therefore of into — j3), and of 
course of the two sets of variables; precisely as happens for the 
equations (15') and (20) which refer to an ordinary translation. 


If in particular we suppose 


i.e. (but not necessarily ^), 
c 


negligible in comparison with unity, the first three of the equations 
(21") reduce to the formula) (15') of the ordinary translation, 
while the fourth gives rise to an additive term denoting the 
difference of time between the two observers S and S, expressed 
by the equation 


'0 

It will be seen that the additional term - ^ depends on the 


position of the point at which 2 has to apply his own measure- 
ments of the time; for this reason t is called the local time. It 
was associated by Lorentz with the ordinary uniform translations 
(15') with the intention of explaining to a first approximation 
(i.e. neglecting the character of electromagnetic phenomena 
for bodies in motion; this requires explicitly that the relation 
ds/ should hold to the same order of approximation. 
Later on Lorentz himself discovered the equations (21"), which 
result in the rigorous invariance of ds^f, Einstein rediscovered 
them from the point of view of this invariance, which is the 
mathematical expression of his principle of relativity in its most 
elementary form. 

Let us examine the formulae (21"). They contain the best- 
known results (to some of which eminent students of relativity 
have assigned paradoxical consequences) of the kinematics of 
relativity. In the first place, the non-invariance of t, as noted 
above in § 3 in general for any (T^), points to the necessity of 
abandoning the ordinary concept of simultaneity in the absolute 
sense. In fact, two instantaneous events, taking place at two 
different points of space, may correspond to the same value of 
t but not of i (a sufficient condition is that the should be 
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different), and may therefore be simultaneous for one observer 
who uses S as his system of reference and not for another who 
uses S. Hence the time ceases to be an absolute quantity and 
becomes relative to the system of reference and connected up 
with the space co-ordinates; it is in fact local lime, to use the 
term already referred to as having been introduced by Lorentz 
in his researches on the electrodynamics of bodies in motion. 

Suppose that two events take place at the same point P of 
the body (and therefore with the same yg* 
same instant, being separated by an interval of time At (measured 
in the system i]): for the observer H the interval will be At, and 
the relation between the two is given at once by the fourth of 
the equations (21"), noting that is constant, so that 


At 


VT- 


Hence for the observer who accompanies the point P where 
the phenomena take place, the interval of time is shorter 
than for the fixed observer S; i.e. we have a slowing down of 
the time with respect to 2’s measure, as if the unit of measure- 
ment had become — times that used by S. 
x/l 

Similarly two events which happen at what is for H the same 
point (i.e. with the same y^^, y^ but separated by an interval 
of time AT, will appear to S to be separated by a longer interval. 
This follows at once from the fact pointed out above that the 
inverse formulae of (21') and (21") are found by changing v into 
— V and therefore P into — 

We shall now try to determine the difference, if any, in the 
estimates of lengths made by two observers S and 2, each at a 
specified instant of his own tinie. Suppose, for instance, we wish 
to carry over to the observer 2 measurements made by 2. 
Substituting in the first three transformation formulae of (21") 
the value of t in terms of t given by the fourth, we get 
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whicli may be considered as resulting from the product of the 
affine transformation 

= yi — = ?/ 2 , 2/8 = Vz 

by the ordinary translation 

Vl = yx + §2 = y2y ^3 = ^ 3 * 

In this form of the equations the change in length is at once 
obvious. We need only refer to the conclusions of the preceding 
section, noting that the elongation coefficients of the 

formulae (19) are in this case represented by v^l — 1, 1. 

Hence if the fixed observer estimates distances at a generic 
instant t, and if his rcisults are compared with those of tlie observer 
attached to the, moving body, who is also estimating the same 
distances at any instant whatever of his own time, then tlui former 
observes a contraction, in the ratio s/l — : 1, for longitudinal 

segynents, i.e. in the direction of motion, while there is no change 
for transverse sec/ments, i.e. perpendicular to the velocity of 
translation. 

The inverse formulae, for the change from S to 2, differ, as 
we have already said, only by the change of v into — v. Hence 
the same rules hold good; e,g. fixed segments in the direction of 
motion will appear to the moving observer as contracted in the 
ratio 1 — : 1 in comparison with the measurement of them 

made by the observer 2; and vso on. 

General transformations. We propose lastly to prove a result 
analogous to one shown above to hold for the translations of 
classical kinematics, namely, that the most general Lorentz 
transformation (A) (i.e. a linear transformation between two 
sets of four variables and {i = 1, 2, 3, 4) for which the 
quadric q remains invariant) can be represented in the symbolic 
form 

RjC R. 

where R and R are ordinary orthogonal transformations (rota- 
tions) between {y^, y^, y^) and {y^, y^, y^), and is a special 
Lorentz transformation of the type studied above. 

< D ) 


II* 
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The transformation (A) will be a quaternary orthogonal 
transformation of the type 

4 

Vh = ( 22 ) 

1 

whose coefficients constitute an orthogonal matrix, i.e. such 

that 4 

^k<^hk0^jk = Hy ( 2 ^) 

1 

^k^kh^kj — (230 

ih,j - 1, 2, 3, 4). 

In order that the variables y 2 , y^, yi, y^y y^y niay be 
real, and y^, y^ pure imaginaries, we must evidently have 
(A, k a 4) real, a /,4 and {h < 4) pure imaginaries, and a 44 real. 

We shall of course interpret y^y y^ as Cartesian co-ordinates 
with respect to a trihedron K rigidly attached to S, and y^ as 
the time variable; and similarly for the jy^'s. 

The directions of the trihedra K and K are a priori arbitrary; 
we shall now determine a rotation R for K and a rotation R 
for K such that we shall ha ve 

A - S/'R. 

To do this, we consider, with reference to A, the vector whose 
components are ai 4 , a 24 , a 34 ; let i denote the relative versor. If 
we turn the trihedron K round in such a way that its axis 
takes the direction of I, we shall have 

®24 ^34 ~ 

we shall take this as the rotation R. 

Now from the identities (23) and the values just given it 
follows that 3 3 

1 1 

3 

1 

SO that the two vectors determined (with respect to K) by the 
components Ogjt, a 3 ^. {k = 1, 2, 3) are of unit length and ortho- 
gonal. We shall call them i, k, and shall take as the rotation R 
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tile rotation which turns the trihedron K so that its axes 
coincide in direction with the vectors j, k, so that we get 

®21 ” = <^31 = Ct32 == 0. 

As a result of the two rotations li and R the form of the 
matrix of the a’s comes to be 


«11 

®12 

®13 

“14 j 

0 

1 

0 

0 

0 

0 

1 

0 

“41 

U’42 

^43 

»44 ' 


and, from the group properties of orthogonal substitutions, this 
matrix must also correspond to a substitution of this kind (since 
it is the result of the product of the original substitution by two 
rotations). A consequence of this is the vanishing of four other 
elements of the matrix: in fact, the conditions that the first line 
of the matrix shall be orthogonal with the second and third lines 
respectively are 

^ cti2 = ciia 0, 


and similarly, taking the fourth line with the second and third, 
we get 


*42 




'43 


0 . 


Thus we finally get the matrix in the form 


«11 

0 

0 

014 

0 

1 

0 

0 

0 

0 

1 

0 

“41 

0 

0 

044 


which corresponds to a transformation of the type (21), i.e. to 
a special Lorentz transformation jC* We have thus shown 
that, through the two rotations R (for the trihedron K) and 
R (for K), the general transformation (A) reduces to a special 
transformation jC- 

So far we have considered only linear transformations. The 
question may be raised whether we should not get greater 
generality if this restriction were removed. In this connexion 
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we shall merely say ^ that the linear transformations studied 
here are the only ones which, in addition to retaining the 
invariance of ds^y make finite values of the y’s correspond to 
finite values of the j^’s, and vice versa. 

9. Relative motion. Composition of velocities. Kinematical 
justification of a formula of Fresnel’s. 

In order to show the relation between the various aspects of 
a single motion — let us say specifically the motion of an assigned 
point P — with reference to two different observers S and S, it 
is only necessary to use the transformation formulae between 
the corresponding co-ordinates. This holds both in ordinary 
kinematics and in relativity kinematics, with the reminder that 
for the latter the time is among the co-ordinates affected by 
the transformation. 

Consider in particular a Lorentz translation, which, as we 
have seen in the preceding section, is defined by the forraulse 
(21"), suitable choice being made in advance of the two trihedra 
which represent the observers and are denoted by S and S. 

Now suppose that the motion (which we can call relative) 
of the point P in relation to 2 is given; i.e. that the expressions 
(?* — ' 1, 2, 3) for its three space co-ordinates are known 
formally as functions of (the Rdmerian time). To obtain a 
representation of the absolute motion, i.e. the motion with 
reference to S, it is obviously sufficient to find the expressions 
for the co-ordinates (i = 1, 2, 3) of the point P as functions 
of the new time variable y^. The transformation formulae (21') 
give the required result at once; in fact, if we insert in them for 
the the expressions y^ {y^ belonging to the moving point P, 
all the y's become known functions of y^^\ and if we suppose this 
parameter found from the fourth equation 

^ _ ^0 + 

and substituted in the first three, we get the equations of absolute 
motion in their explicit form. 

^ For the prf»of, cf. C. Munari: “Sopra ona eHpressiva interpretazione cine- 
matica del I’riricipio di KelativitA, ”, in Rend, della R. Avc. dci Lincei,Vo\, XXIII 
(1914), p. 7S1. 
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The resulting relation between the absolute and relative 
velocities is especially interesting. The vector rule no longer 
holds that the absolute velocity = the relative velocity + the 
velocity of the moving origin (the latter, in the case of trans- 
lations, of course reduces to the velocity of translation, whatever 
may be the instantaneous position of P), The relativity com- 
position of velocities is a little more complicated. In order to 
see what happens in the clearest case, we shall consider a relative 
motion parallel to the translation v. With this hypothesis the 
co-ordinates and of the point P are constant, and, from 
(21'), ^2 ^3 are also constant, or, in other words, the motion 

with respect to S is also in the direction of the translation. 
DiflEerentiating the first and fourth equations of (21'), we get 

%i + Pdy^, 

“ Vi^’ 


Putting for the sake of shortness 




B = 

dyo dy„’ 


so that and jS,. are the velocities (scalar and Romerian) of the 
point P with respect to S and S respectively (absolute velocity 
and relative velocity), the foregoing formulae, on dividing the 
first by the second, give 


- 


A ^ . 
i + y3A.’ 


( 24 ) 


this is what is called Einstein’s law for the composition of 
velocities. Multipijnng by c and remembering that y^ = ct, 
y^ =r= cf, we can evidently replace the Romerian velocities 


A 


= A ~ > A corresponding ordinary velocities 

dyo dy^ 


V 


dyi 

dt 


, V, and write 


Vr + V 




( 24 ') 
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If both the velocity v, of P (with respect to S) and the velocity 
of translation v are small in comparison with c, the denominator 
differs from unity by a term of the second order; if we neglect 
this difference we get back to the fundamental relation of ordinary 
kinematics (which may be called Galilean) 

Va — + 

in view of the criterion we are applying, this result was of course 
to be expected. In general the equation (24) shows that, for ^ | 
and I j less than unity, | ( also < 1; while, for | /8 or 

I Pr I equal to unity, | ( also =1. To prove this, note that, 

whenever | /3 | <; 1 and | )8, | <: 1, 

(1 + - ()3 + Pr)^ = (1 - )32) (1 - 

is always positive, so that p^ = <1; while for 

\1 + PPr/ 

I /3 I = 1, or I jS,. I = 1, Pa = 1; which proves the required 
result. We thus find once more the limiting character of the 
velocity c of light: however near may be to c, provided it is 
less than c ()3,. < 1), if it is compounded Avith another velocity of 
translation v, less than c, but as nearly equal to c as we please 
i\P\ < 1), the result will always be less than c, or in other 
words I I always <; 1. Vice versa, the velocity c for S remains 
c for any S, whatever may be the velocity of the (Lorentz) 
translation with which the two observers are moving with respect 
to one another. 

Within the scale of velocities of ponderable bodies (velocities 
small compared with c), the relation (24') reduces sensibly to the 
Galilean formula — v,.-\- v, as we have already said. But 
when the phenomenon of motion under consideration is the 
propagation of light in a transparent medium, so that the velocity 
has an order of magnitude comparable with that of c, then the 
divergence between the Einsteinian and the Galilean kinematics 
becomes striking, and lends itself to experimental verification. 

Einstein has in fact drawn from this a magnificent argument 
in support of the theory of relativity. He deduced logically (by 
a purely kincmatical proof from (24') a formula of Fresnel's 

^ Even before Einstein, Lorentz bad given a theoretical justification of FresneVs 
formula, based on his celebrated electron theuiy of the electromagnetic phenomena 
of bodies in motion. Einstein's explanation is plainly more attractive. 
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concerning the movement of light waves through transparent 
media in translatory motion; a formula which was experimentally 
confirmed for the first time by Fizeau (1851), whose experiments 
were repeated with improved methods by Michelson and Morlcy 
and by Zeeman. 

The argument is briefly as follows. In a medium of refractive 


index p,, it is known that light is propagated with velocity — , if 

the medium is at rest. Suppose instead that the medium has a 
velocity v in the direction of propagation of light (in the same 
or the opposite sense). Ordinary kinematics would lead us to 
expect that the velocity of propagation (with respect to the 

observer) would become - + v; Fizeau and the others, however, 

by delicate experiments on interference phenomena, found that 

the amount to be added to - (or subtracted from it) is not the 

. . ^ 1 
whole of V, but v multiplied by the coefficient (<; 1) 1 — 

so that the velocity of propagation is ^ 

<“> 

The factor 1 — ^ is known as FresneVs corweHion coeffidenL 

The expressions (25) are evidently not in agreement with the 
Galilean kinematics. But they are in excellent agreement with 
the Einsteinian kinematics. In fact, let us consider, to fix 
ideas, the case in which the motion of the medium is in the same 
sense as the propagation of light, so that we take the + sign in 

c 1 

(25). Then (24') holds when we put - for v,, and therefore - for 
Hence it gives ^ ^ 


_ ^ 


+ v 


1 + 


A* 


or, neglecting terms of the second order in )8( = -), 
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The last term is also of the second order with respect to the 
first, so that we are left finally with Fresnel’s formula 




V. 


10. Further generalization of the metric of still coinciding 
to a first approximation with ordinary dynamics. 

We now propose to see whether it is possible to assign to the 
F4 other metrics slightly difierent from that characterized by 
(5), but such that the dynamical principle underlying them is 
still equivalent, to a first approximation, to Hamilton’s principle. 

We return to the general form (5') of and observe first 
that the particular form (6) just considered is a case of (6') 
obtained by identifying the time co-ordinate Xq with ct and the 
space co-ordinates Xq, with the Cartesian co-ordinates 

i/v 2/2^ 2/3» and putting 
917 

^00 = 1 - T’ 9oi = 0, 9i, = -81 {i, * - 1, 2, 3). (26) 

If now we wish to consider a metric wliose coefficients g 
differ by very little from the values (26), we can put 

ifoo = 1 — 2^. 9oi = — y.. 9ik (26') 

where ^ ^ 


with the understanding that the quantities y (which as regards 
dimensions are pure numbers) are of the second order Hike 

V n . \ ^ / 

or higher order with respect to while ^ (vrhich also has the 

c 


dimensions of a number) is to be considered as of at least the 
third order. 

With these values for the coefficients, and taking the variables 
Vv Vz co-ordinates differing very little from Cartesians, 

we can write ds^ in the form 


■=(l-2^)%o® 


- 'Uy^ S, yi dyi — ILy, (8J + yi^)dyi dy ^ . (27) 

1 1 



METRIC OF SPACE-TIME 321 


If we denote derivation with respect to by a dash, and put 



1 


^2 ^ yUc Vi Vk 

1 

we shall have 


ds2 


1 — 2^ — 2Ti — 2^2 


(28) 




It is to be noted that since is of the first order, 

c dt 

it follows that is of the tliird and of the fourth order at 
least. 

To shorten the work, we shall introduce the quadrinomial 


observing that it is composed of terms of the second order, 

which can be written + f7), plus terms of higher order. 

We then have ^ 


ds^ 



1 -2r, 


and we can extract the square root, neglecting powers of F 
higher than the second (i.e. terms of order higher than the fourth). 
To this degree of approximation we get 


ds 

dyo 


1 - r - 


i.e. rewriting c dt for dy^, and multiplying by (?dt, 
cds ^ c^dt — cMt(r “f- ^F^). 

Substituting this expression in the variational equation of 
dynamics 

sjcds ^ 0 , 
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and remembering that 8t vanishes at the limits of integration, we 
see that this equation reduces to 

s/c2(r+ = 0, 

and that the corresponding Lagrangian function is therefore 

L = c*(r+|n), 

or expanding, and neglecting terms of order higher than the 
fourth (in the sense just defined, i.e. ignoring the presence of the 
factor c®), 

~ (29) 

The first two terms of the expression on the right (reduced 
to zero dimensions, i.e. divided by c^) are of the second order; 
they constitute the Lagrangian function of the classical mechanics, 
from which we began our investigations. 

The successive terms of (29) (reduced to zero dimensions in 
the same way) are of higher order: hence they will represent 
small corrections to be applied to the equations of motion. The 
metric (27) which we have iicre assumed still gives, therefore, 
to a first approximation, the same laws as are deduced from 
Hamilton’s classical principle. Besides the potential U, it 
contains the ten functions ip, y,, of the four variables y (position 
and time); these are small, as we agreed, and as we have repeatedly 
had to remember in making tlie various transformations, but 
are a priori arbitrary. We shall see farther on how the law of 
universal gravitation and a criterion provided by the tensor 
calculus lead to the determination of these ten functions (from 
ten differential equations), and so to an explanation of some 
slight divergences which have been observed between the results 
predicted by the Neutionian mechanics and the true motion of 
the heavenly bodies. This more exact correspondence between 
theory and observation provides a physical justification of 
Einstein’s new method of approach, which further incontestably 
represents an enormous speculative advance through its charac- 
teristic of securing invariance for all transformations of the 
co-ordinates, not only of space, but also of time. 
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11. An important particular case. Corresponding toajectories 
and thete identity witb those of an ordinary mechanical problem. 

We shall now apply the expression (29) for £ to a special case, 
the interest of which will be seen in next chapter (§ 8, p. 394). 
Suppose that we have 


Ti =- 0, 



(either exactly, or neglecting terms of order higher than the third 
and fourth respectively), where x function of the y's, of at 
least the second order. Suppose further that ip and x. like U, do 
not explicitly depend on the time. We shall meet later on a 
characteristic example in which this condition is satisfied. 

The expression (29) can now be written 

£= K^1 + 2J + x) + C^+c^^+2(— (30) 


_ m 2 

It is to be noted that ( i in the last term is of the 

\ c2 / 

fourth order, while the principal part of // is of the second order; 

— U 

hence, in this last term, we may calculate - ^ only to a first 


c- 


approximation. But we know that to a first approximation the 
classical mechanics holds, and that therefore the integral of vis 
viva exists in the form 


u rzsz Eg = constant; 

hence the last terra on the right of (30) can be replaced by the 
constant or even suppressed, since a constant contributes 

nothing to the variational equation. 

The remaining terms of L can be separated into two groups, 
according as they do or do not depend on the velocity, by putting 

E/+c=^ 


and therefore 


T+ a 
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This form of the Lagrangian function corresponds exactly 
to the form found in the classical mechanics (for a system with 
three degrees of freedom, if not for a material particle), if we 
consider T as corresponding to the ms viva and [/ to the potential. 
Further, it is known ^ that, whenever (as in this case) Z' is a 

quadratic form in the quantities not explicitly con- 

dt 

taining t, and £/' is a function only of the Lagrangian co-ordinates 
y, then the diflEerential equations arising from 

s/(7'+ cr)dt = 0 

admit of the integral (of vis viva) 

T-U^- E, 

where E is a constant (the total energy), and the trajectories 
corresponding to a given value of E are identical with the geodesics 
of a manifold such that the square of its line element is defined by 

ds^ 2 (£/‘+ E)Tdfi. 

(Principle of Stationary Action.) Applying all this to our case 
we shall have the integral of '>'is viva^ in the form 


+ x) - (C^+ - E, 


and, for any value of E fixed in advance, we can assert that the 
trajectories coincide with the geodesics of the manifold 

dsjS - (u+cy,-\- E^ (l + + x) 

a 

where dl^ ™ 

1 

or with the trajectories of the motion, in ordinary space, of a 
materia] particle with total energy zero, and acted on by forces 
derived from the potential 

^ Of. for example Lkvi-Civita and Amaldi: Leziont di meccanioa razionede, 
Vol, II, Chapter XI, No. 16 (Boloj^na, Zanichelli ; in the press); or WniiTAKaR: 
Analytioal Dynamic9n 2nd edition, Chapter IX (Cambridge University Press, 1917). 
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which can also be written (neglecting constant terms and terms 
of higher order) as 

V^ = V + dhls+^^+Ux+B(^^ + x)- ( 31 ) 

12. Qualitative characteristics of relativity metrics. Geodesic 
principle for the dynamics of a material particle. Stationary and, 
in particular, statical line elements. 

In accordance with the remarks at the end of § 10, the 
metric of the space-time manifold in the region round a generic 
point must be regarded in concrete cases in close connexion with 
the physical phenomena which take place in space and time, 
particularly in the neighbourhood of the point and instant con- 
sidered. The quantitative dependence will be duly established 
in next chapter. At any rate, in ordinary cases, as has been 
seen, we can never go far from a pseudo-Kuclidean metric. This 
leads to the condition that in the real world of physics the metric 
of F4 is to have the same qualitative properties as those belong- 
ing to the pseudo-Euclidean metrics. In particular, the index of 
inertia must be 3 , which implies (as could be proved) that in every 
set of four orthogonal directions drawn from a generic point, 
three are spacelike {ds^ <; 0) and one is timelike {ds^ > 0). 

By relativity metric ’’ we shall from now onwards mean an 
indefinite metric subject to these qualitative restrictions. 

In a F4 with a definite metric there is no qualitative distinc- 
tion to be made between the various lines in it, while in a relativity 
F4, as wo have already pointed out, we have at every point 
three kinds of direction, according as ds^ < or >> or == 0, and, 
corresponding to these, three kinds of line — spacelike, timelike, 
and lines of zero length. Naturally the classification is much 
more complicated for manifolds of two or three dimensions 
immersed in a F4 with an indefinite metric; and the same choice 
of the variables of reference (which is geometrically equivalent 
to the choice of co-ordinate hypersurfaces) would in general 
require preliminary close study of the local behaviour from this 
point of view, 

\Fe shall avoid any discussion of this kind, and shall impose 
some limits on the arbitrariness of the choice of co-ordinates 
by taking as a model what happens in the case of a pseudo- 
Euclidean ds^ referred to ordinary time t (or a linear function 
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of t) and three space co-ordinates which are entirely 

arbitrary. Of the four co-ordinate lines one (y^) will then be 
timelike and the others spacelike; further, on any hypersurface 
== constant we have 

= eonat. ~ dP^ 

where dP is a positive definite differential quadric, so that we can 
say that a purely sj)acelike metric, like that of ordinary geometry, 
holds in every timelike section of the space-time. We shall 
constantly refer the relativity manifold to co-ordinates x^, 
^ 2 ^ ^3 for which this qualitative property holds. 

Granting these various preliminaries we reach the following 
geodesic principle — derived from the particular cases in sections 
4 and 10 by an obvious generalization — which, in Einstein’s work, 
appears as a fundamental law of the dynamics of a material particle 
in clearly specified physical conditions (i.e. for an assigned ds^)\ 
The world lines of a generic free material particle are identical 
with the geodesics of the corresponding ds^, and more precisely 
with the timelike geodesics. In other words, these world lines 
satisfy the variational equation 

sfds = 0, 

making at the same time ds'^ > 0. 

Among the relativity metrics special interest attaches to those 
in which it is possible to choose a system of reference such that 
the ten coefficients g^j^ shall all be independent of the timelike 
parameter x^; metrics of this kind are called stationary (in 
relation to the particular system of reference chosen). The 
justification for this name is obvious if it is remembered that in 
physics a phenomenon which takes place in a continuous medium, 
and is determined by a certain number of parameters which are 
functions of position and of the time (e.g. the motion of a fluid) 
is called stationary if these parameters do not depend explicitly 
on the time. 

In particular, a stationary metric will be called statical when 
the coefficients g^i (i = 1,2, 3) of the three product terms in 
dxQ vanish, i.e. when in the expression 

3 3 

= 5^00+ »• H- ^a9iic 4 • • (32) 

1 1 
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(in wHch dashed denote difierentiation with respect to Xq) the 
terms of the first degree in are missing. The justification for 
the name is somewhat more indirect, and will appear from the 
following considerations. 

It is known, and can in any case be verified at once, that when 
L is an even function of the (as in the case we are considering) 
the Lagrangian equations (4') define a reversible motion, i.e. such 
that if P = P{t) represents the motion starting from a certain 
initial position Pq with an initial velocity then on changing t 
into — t (i.e. considering the motion defined by P = P( — t)) 
we have the solution corresponding to the same initial position 
and the same initial velocity but in the reverse direction. Further, 
in the classical mechanics it is known that the motion of a particle 
is reversible whenever the field of force is invariable with respect 
to the time, i.e. when the field is statical (in the ordinary sense of 
the word). Hence the application of the term statical to a relativity 
metric whose geodesics are reversible with respect to the timelike 
variable Xq. 

In the statical case it is usual to put 
3q0 ” 9ik ~ 

so that (32) becomes 

.... (32') 

1 

this coefficient has an important mechanical meaning, which 
we shall now explain. 

If at a given instant the velocity of the moving point vanishes, 
i.e. if each Xi — 0 (case of initial motion starting from rest), 
we have in particular from (4') and (32') 

1.2,3) 

1 OXi 

(the two dashes of course denoting double differentiation with 
respect to o^q); these define the quantities as functions of 
position. The terms on the right, 


372 
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being derivatives of a single function — evidently constitute 
a covariant system (for any transformations whatever of the 
space co-ordinates). Hence the Xi’s constitute the covariant 
components of a spacelike vector P = grad ( — JF*). The 
contravariant components 

1 

of this vector, from the preceding formulae, are identical with the 
initial accelerations. Hence the vector P obviously provides the 
statical measure of the force (per unit mass) of the field (the 
initial acceleration of a free material particle, or, if preferred, 
the force per imit mass which must be overcome to maintain the 
particle at rest). 

Consider, beside the point P of co-ordinates x^, a neighbouring 
point P' of co-ordinates -{- dxi and the (invariant) trinomial 

i, Xidxi == - 

1 

Defining, as is natural, the virtual work of P for the displace- 
ment PP' as the product of the displacement by the ortho- 
gonal projection of the force (just as in ordinary Eucliden,n 
space), the preceding identity shows that — ^F^ constitutes 
the potential function of the force acting in the field in statical 
conditions. 

As has been seen just above, in the statical case the force in 
the field can be very simply expressed by means of the single 
coefficient = F^. In more general conditions, the whole of 
the mechanics of the point is summed up in Einstein’s geodesic 
principle, or, as an alternative form, in the consequent Lagrangian 
equations (4'); an analogous argument can also be developed for 
the initial motion, and the expression for the force in the field 
(at a generic point and instant) as a function of the g’& deduced 
from it, but the results are by no means so simple and expressive 
as in the statical case. To put it briefly, the concepts of mass, 
force, and energy are all contained in the four-dimensional metric, 
but, at least in general, the task of distinguishing between them 
and associating them with the coefficients of ds^ seems to be neither 
easy nor fruitful of further results. 
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13. Versors in a F 4 with pseudo-Euolidean metric. 

An important fact in connexion with a ver»or (unit vector) 
in the space-time manifold F 4 is that it can always be made to 
correspond to a vector in three dimensions. This follows from the 
fact that it has four parameters (or moments), only three of which 
are independent, in virtue of the quadratic identity expressing 
that the length of the vector is imity (cf. Chap. V, p. 91). The 
interpretation of a vector of this kind as a velocity gives par- 
ticularly interesting results. 

Let us consider — limiting the case to a pseudo-Euclidean 
F 4 — a generic motion defining yg* Vz functions of and 
giving rise to a world line in F 4 . If, as we shall first suppose, 
the velocity of the motion <; c, we shall get a timelike line, in 
which the corresponding 

ds^ — iL^dy^ = dy^ 


is positive. If, on the other hand, the velocity > c (i.e. j 8 > 1 ), 
ds^ is negative, and we shall have a spacelike versor (cf. Chap. V, 

p. 142. In either case, denoting as usual the components of 

the Romerian velocity by and the direction cosines of this 

_ A- 


velocity by a, = wo obviously get the expressions 


40 ^ 

j I Vl 




^.Vo I 






for the parameters of the world lino (i.e. of the versor 5 tangential 
to it). 

Given the three components of an ordinary vector (3, these 
formulsB determine the four parameters of a four-dimensional 
unit vector (versor) vice versa. Given jff, the versor (in 

the ordinary sense) a belonging to it is of course fixed without 
ambiguity (provided jS =4= 0). It will sometimes be convenient 
to describe a as the versor reduced from the four-dimensional 
versor For ^ 0 the versor ^ has its components 
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all zero, and is accordingly called jyu/rely timeliJce, If instead we 
consid^ the case of a very large velocity in a direction a (i*e. 
if we make j8 tend to infinity, while the ratios between the 
remain determinate), then we have = 0, while the other 
components f ‘ reduce to the direction cosines a* of the reduced 
versor. In this case the four-dimensional versor 5 is called purely 
spacelike; it is tangential to the three-dimensional manifold 
(space) -- constant, or rather coincides with the versor a 

belonging to this manifold. 

All this can easily be extended to the case of a 1^4 of any 
metric whatever, referred to any co-ordinates Xq, Xg, the 
first timelike and the other three spacelike, and characterized 
by the form 

3 

ds^ dx^L\ 

0 

where denotes, as in § 12, the expression 

0 

and, as usual, fi = . 

u-Xq 

Given a geneiic versor of parameters 

0, 1, 2, 3) 


x[ ^ P dxi 

L L dV 

14. Digression on geodesics of zero length. 

Let T denote a parameter of any kind such that the co-ordinates 
X can be considered functions of it, and put 

^ h J O ~ \^ik9ik^'i^k 
dT** 0 

(dots denoting diflCTentiation with respect to t). Consider the 
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equations of motion of a material system as summed up in the 
variational equation 

8j2TdT = 0 (33) 


We know from ordinary mechanics that if t denotes the time 
and T the vis viva of a material system, then the Lagrangian 
equations implicit in (33), i.e. 


d dT _dT 

dr dxi dx^ 


{i - 0, 1, 2, 3), 


(34) 


define the spontaneous motion of the system and have as a first 
integral the equation 

T — E — constant. 


Wlienever the value of the constant E is different from zero, 
then by using the equation T = JF it is easy to eliminate the 
parameter r from (33) and obtain from it a variational equation 
capable of defining the trajectories. We have in fact, from the 
definition of T, 

\/ 2T dr = dSy 

so that the expression for the action^ i.e. the integral j2Tdr 
which occurs in (33), can be written 

JirdT = >j2Ejj2TdT = s/^fds. 

Hence, for E ^0, the variational equation (33), by elimination 
of the parameter t, gives the equation 

8[ds == 0 (35) 

which is the characteristic equation of the geodesics in the F 4 
whose lino element is ds. From this equation we can deduce, as 
in § 24, p. 131, the differential equations 

Xi + Hji {fl, i}XjXi = 0 . . . . (36) 

0 

of the geodesics, where dots denote differentiation vidth respect 
to s. The same equations would also be obtained, but with t 
instead of 8, by writing out (34) in full and solving for the x’a. 
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To sum up, for E 4 = 0, it is a matter of indifierence whether we 
define the geodesics of F 4 as trajectories derived from the varia- 
tional equation (33), or by means of the typical property (35). 

We now propose to examine separately the case E = 0; 
since T = E and ds^ — 2T this is equivalent to rfs = 0 
along the whole of the line in question, which therefore in this 
case takes the name of geodesic of zero length. (Such lines are of 
course real only if ds^ is an indefinite form.) In this case (35) 
is no longer suitable for defining geodesics; the method just 
referred to and used in § 24, p. 131, to obtain the differential 
equations also breaks down, since it assumes 5 as the independent 
variable, and therefore excludes the possibility of ds being identi- 
cally = 0. The equations (34), however, keep their significance, 
and therefore offer a means of defining geodesics of zero length 
by a process of passing to the limit (in conditions of complete 
analytical regularity) from ordinary geodesics. We shall thus 
apply the term “ geodesics of zero length ’’ to the lines represented 
by solutions of the Lagrangian system (34) for the value zero of 
the constant E. 

The differential equations (3G) of ordinary geodesics give 
Xq, X 2 , Xq directly as functions of a parameter r (or in particular 
of s). We can suppose the parameter eliminated after integration, 
giving, for example, x^, as functions of Xq, But it is also 
possible to eliminate the parameter beforehand, by obtaining 
from (35) three differential equations which define x^, x^ 
as functions of Xq. To do this, we introduce Xq as the independent 
variable in (35), so that it takes the form 

sfLdxo=0 (35') 


where, as in § 12, we have put 

s s 

^ 9oo+ 2Siffoi x'i, x[ 
1 1 


From (35') we deduce, by the ordinary method, the three 
required Lagrangian equations, which are 


d 0L az 

dxQ dx'i dxi 


(*• == 1,2, 3). 


These are completely equivalent to (35'), since, as was seen 
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in § 2, the fourth equation (to be obtained by making Xq vary 
and equating to zero the coefficient of S^q) is a necessary conse- 
quence of these three. 

These equations, like (35) above, lose their significance in 
the case of geodesics of zero length (JS = 0). An analogous 
reduction can be found in this case too, but it is preferable to 
follow another method and leave the pure Lagrangian form. 
This method is as follows. 

We start from (33) instead of (35), and note that from the 
definition of T and L we have obviously 

2T = ^ . 


Suppose that is not constant along the geodesic (or arc of 
geodesic) of zero length under consideration.^ In the integral 
(33) (corresponding to a generic geodesic of the kind in question) 
we can then assume Xq instead of t as the independent variable, 
so that 

h{L^^'^dxo = 0 . 

J dr 


The parameter r is a function, a priori unknown, of such 
dx 

that ~ remains finite and not zero. We can therefore put 
dr 


dr 

dxQ 


A(Xo) 


where A and A are also finite and not zero. Then the preceding 
variational formula becomes 


sj^L'^AdxQ ^ 0 , 

from which we get for the geodesics the equations 

/ a ^1.2,3); 

dxQ dxi 


’ From the limitations introduced in § 12, this condition can always be satisfied 
in the real field. In fact, if we put dx^ = 0 in there remains a definite 
negative form, which cannot vanish along an actual line, i.e. when dxi, dx^ dx^ 
do not vanish simultaneously. 
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expanding and dividing by A, these can be written in the fonn 

dx0 dxi dxi dx'i 

The parameter A can be at once eliminated from these. 
Denoring for shortness the Lagrangian binomial on the left-hand 
side by we get finally the two equations 

'^2 ■■■■ '>'3 

djl ■ dj^ dU- 

“dx-^ dx^ dxg 

which are to be taken together with the equation 

L2 = 0. 

15. Some elementary theorems of geometrical optics. 

It is known that in a transparent homogeneous medium light 
is propagated in a straight line with constant velocity if no dis- 
turbing influence is at work. In the case of an isotropic medium 
— the only one we shall consider— the velocity is always the same 
in all directions and therefore is a constant characteristic of the 
medium. In vacuo (cf. § 4) the velocity is, in round numbers, 

c --- 3 X 10^® cm. /sec. 

or 300,000 kilometres per second. 

If instead we have a heterogeneous medium, in which the 
refractive index /x (which is defined as the reciprocal of the velocity 
of propagation) varies from point to point, then the rays are in 
general not rectilinear but are bent in accordance with a law 
which depends on the way in whieh ju. varies, i.e. on the function 
fi{x, y, z). This law can be put in a compact and useful form 
in the following way.’^ If the initial point Pq and the final point 
Pi of the path of a ray of light are fixed, the time taken by the 
ray to go from Pq to Pj along a line s will obviously be expressed 
by the integral 



since fj,, as we have just said, is the reciprocal of the velocity. 
^Cf. for example Lkvi-Civita and Amaliu: op. cit., Chap. XL No. 18. 
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Now the line actually followed by the light is the one which 
makes this integral a minimum, and therefore satisfies the 
condition 

& = 0 . 

This variational equation, which sums up the whole of 
geometrical optics, is known as Fermat'' s principle. 


16. Oeometrical optics according to Einstein and the meaning 
ot the constant c. 

In constructing a geometrical scheme to represent light rays 
the existence is assumed of an absolute frame of reference, exactly 
as is done in the Newtonian mechanics. In order to help the 
imagination, the system of reference is supposed to be provided 
by a hypothetical medium at rest— the so-called cosmic ether — 
which constitutes as it were a background or support for all 
optical phenomena. In space free from ponderable matter light 
is propagated in a straight line with constant velocity c with 
respect to the ether, or, which is the same thing, with respect to 
fixed axes, where fixed ” axes mean axes at rest with respect 
to the ether. Hence c is the velocity of light as it appears to a 
generic observer O, at rest with respect to the ether. 

Consider a solid C moving with velocity u (a pure translatory 
motion) and a pencil of parallel rays of light which are being 
propagated in the same sense as the motion of C. 

With respect to the observer O, the luminous phenomenon 
is diagrammatically represented, as we have just noted, as a 
particular uniform motion with velocity c. 

According to ordinary kinematics, the analogous velocity 
with respect to an observer O' rigidly attached to C is c — u. 

Now within the range of velocities which can be realized by 

material bodies the ratio and still more its square - (only 

c (r 

the latter of which can be submitted to effective experimental 
control) are small; we can, however, take it as definitely estab- 
lished that the velocity of propagation is still c with respect 
to O’ also. This follows from the classical Michelson-Morley 
experiment, subsequently repeated by other physicists, and 
recently on new bases by Brofessor Majorana. 

In order to explain this experimental result, it is evidently 
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sufficient that the phenomenon which appears to macroscoiHC 
methods of measurement as the translation of a body C with 
velocity u. should, with more refined methods of measurement, 
be a transformation (A). The study of these transformations has 
in fact shown that any ordinary uniform translation is almost 
indistinguishable from a (A), the difference being of the order 

of one ten-millionth, provided that - < 10”^. 

c 

The classical laws of geometrical optics (that the propagation 
of light is rectilinear, \miform, and with velocity c), and the 
famous experiments referred to above, will therefore still hold 
if we suppose that for the propagation of light, as for the motion 
of a material particle under no forces, the equation 

hjds^ — 0 

holds, with the condition 

ds^ ~ 0 

(equations of uniform motion with velocity c); and if, on the 
other hand, we consider the phenomenon of the translation of 
solid bodies as very slightly different from the description of 
ordinary kinematics, so that it corresponds to a transformation 

(A). 

Hence these special kinds of motion which correspond to 
the propagation of light in the ether, in the absence of disturbing 
influences, are dependent on the form 

ds^^ c^dfi- dl^^, .... (37) 

in which the constant c has a specific numerical value. 

For ordinary motion, with velocities which are at most 
planetary, and under the action of conservative forces — e.g. in 
the presence of assigned masses — the same part is played by the 
form 

ds^ ^ (c2 ~ 217) dtP‘ - dl^^ . . . (37') 

in which on the one hand the constant c is subject only to the 
qualitative restriction of being sufficiently large, and on the 
other the influence of the masses modifies to some extent the 
coefficient of dt^. If we aim at attaining unity of conception of 
physical phenomena, we shall obviously be constrained, emteris 
'paribuSy to adopt a single differential form ds^ as the determining 



bEbMEirfelCAL 6 ptics 


337 


form both for the motion of material particles and for the behaviour 
of light fays, serving as a basis for both cases. We must therefore 
assign to the constant c, in the general dynamical case, the same 
specific value as belongs to it in the particular optical pheno- 
menon. In the absence of distmbing i^uences, in particular of 
masses at a perceptible distance, so that U = 0, the ds^ of 
mechanics then becomes identical with the ds^ of optics (the 
limiting case). 

Further, since in the case U — 0 (i.e. in the absence of 
masses at a perceptible distance) the interventicm of ds^ has led 
to geometrical optics being summarized in two laws which appear 
as limiting cases of dynamical laws, we are led to hope for the 
extension of the same crit»erion also to the case in which masses 
exist (Z7 = 4 ^ 0 ). 

The propagation of light will therefore be governed in any 
case by the following postulates: 

( 1 ) The geodesic principle (as for material motion), 

Sjds -= 0; (38) 

( 2 ) ds*^ ==- 0 , whicli is equivalent to saying that the motions 

dl 

in question have the square of the velocity ", equal to 

- 2D _ 

The velocity F is thus slightly less than c; neglecting terms 
whicli are in fact absolutely negligible, it is given by 



These two postulates can be summed up in a single illuminating 
geometrical assertion: 

In the metric we hm)e assigned to V 4 , the world lines of light 
are geodesics of zero length. 

It is to be noted that this assertion has an invariant form, 
and is therefore suitable for defining the behaviour of light rays, 
even if these are referred to a system of any co-ordinates 
Xi, X 2 , X 3 whatever, instead of to the particular system t, y^. 

The assertion lends itself to an obvious generalization, since it 

tDe66) 
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is natural to extend its scope so that it shall continue to hold 
even when the which characterizes the metric of the F4, 
though satisfying the qualitative restrictions of § 12, is not 
reducible to the particular form (37). 

17. Interpretation in geometrical optics of the condition 

= 0 . 

Given any direction in the four-dimensional space (if, 

Xg, X3), i.e. any system of increments (dt, rfxj, dx^, dx^), we can 
obviously make a vector (velocity) v correspond to it in the 
physical space whose line element is given by 

3 3 

dP = dx^ S.t a,* rfx; rfa:*, . (39) 

1 1 

or more precisely in the Euclidean space tangential to the given 
space at the generic point from which the specified increments 
are drawn. 

We shall take the ratios 

(*■ = 2, 3) 

for the contravariant system of this vector with respect to the 
metric (39). Writing these in the form 

dxi dl 
dl d{ 


dx ' . . 

we see from the presence of the factor which is the direction 

dl 

parameter, that the positive factor ^ measures the length of the 

dt 


vector. Referring back to the equation (39), we have for the square 
of this length 


dP 


Another vector w, a function solely of position and time 
(of position alone in stationary conditions), can be made to corre- 
spond to the set of three coefficients g^i, which are covariant with 
respect to any transformations whatever of the space co-ordinates 
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alone, by taking these three quantities for the covariant system 
of the vector. Then, denoting as usual the coefficients of the form 
reciprocal to (39) by a* and putting 

3 

= 'Luca"' goi9ok> 

1 

we get w for the length and (for w > 0) the ratios for the 

w 

moments (the system reciprocal to the parameters) of the 
direction of this vector* It is to be noted that if the spacelike 
co-ordinates x have the dimensions of a length, the coefficients 
auc of and therefore their reciprocals are pure numbers, 
while the coefficients of the product terms in t have the dimen- 
sions of a velocity. Hence the vector w, like v, can be interpreted 
as a velocity. It will be obvious that this conclusion still holds 
even if the dimensions of the co-ordinates are left 

indeterminate. 

If <f> denotes the angle between v and w, both for the moment 
supposed not zero, we have for the metric (39) 

COS^ S; 

1 W V 

and therefore identically 

3 

vw co8^ = (40) 

1 

which holds even if v or w vanishes. 

Using (39) and (40), the expression for ds^ can now be written 
in the form 

= dt^{V'^ + cos^ — 


putting F® = .9oo- 

This makes it evident that the condition rfs® — 0, charac- 
teristic of the propagation of light, defines its velocity as a 
function of the position and direction of the ray, as well as of 
the time, in the general case in which the coefficients of ds^^ and 
with them F, and <f>, depend on L 


Representing the ratios 



tv 

V 


(both positive and pure 


numbers) by and we have for p the equation of the second 
degree 

j32 — 2p cos^ ^—1 = 0; . . . (41) 
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the pioduct of the roots being — 1, it follows that one is positive 
and the other negative. By definition v is nece^arily positive, 
so that it is uniquely determined by (41). 

When all the product terms in dt vanish (the statical case), 
tv == O; hence J3 1, and v coincides with F. In general 
p > 0, and the difference between v and V (for a specified posi- 
tion and time) depends on the direction of the ray, i.e. on the 
angle which it makes with w. We also have v = V for every 
ray perpendicular to w. It is obvious from (41) that the maximum 
and minimum values of /3 correspond to = 0 and ^ = tt. 
This is equivalent to saying that the maximum velocity of pro- 
pagation 

V(s/l + p + p) 

is along w, and the minimum velocity 

F (\/ i + p^ — p) 

is in the same direction but in the opposite sense. 

Except in the statical case, it will be seen that the propagation 
of light in physical space is not only non-symmetrical for opposite 
senses but is completely irreversible. 

18. Fermat’s principle in stationary relativity metrics. 

We saw in § 14 how the difficulty involved in the variational 
principle Sj^ds==0 for ds^ 0 can be evaded in finding the 
explicit form of the differential equations of the propagation of 
light. It is not without interest to note that for every stationary 
ds^ the behaviour of the light rays can also be defined by 
associating Fermat’s principle of the minimum time with the 
equation ds^ = 0, i.e. hy assuming 

Sfdx^ = 0, (42) 

with the condition that dx^ is to be connected with the space 
co-ordinates, and their differentials by ds^ 0. Naturally 
while in the four-dimensional geodesic principle expresseil by (38) 
not only dx^, dx^, dx^ but also dx^ are to be zero at the ex- 
tremities of the interval of integration, in (42) this condition 
must not apply to dx^^ as it would reduce (42) to a mere identity. 
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We now propose to establish the equivalence, for every 
stationary metric, of the two principles of geometrical optics: 
(a) the four-dimensional geodesic principle, and (6) the principle 
of minimum time. 

To do this we must consider the geodesics of zero length as 
derived by the method of limits from timelike geodesics (ds^ >• 0). 
For the latter we put as usual 




{i = 1, 2, 3) 


= 


dxo 

dP » / ./ 

J— 2 — ^ik 

dX(f 1 

dxa 


F® + 2 2, flr,„ x[ — S* Oift x'i xl 

1 I 


(43) 


where the function L has finite partial derivatives, since 
and therefore L, is not to vanish. 

The equation (38) can be written 

SjLdxQ --- 0 (44) 


Taking the variation with respect to the co-ordinates x^, jCg, 
we get by the classical procedure the Lagrangian equations 


d dL 

dxQ 


: = 0 {i 1, 2, 3); 

dXi 


(45) 


while the variation with respect to gives 


A 

dxQ 


(?■ H *•' “ + 


dXf, 


= 0 , 


which is a necessary consequence of the equations (46). 

On the hypothesis, characteristic of the stationary case, that 
L does not explicitly contain Xq, we get the integral 

3 pi r 

E, .... (46) 

1 CXi 

where the constant E represents the total energy of the moving 
point. 
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Multiplying by L, the left-liand side may be written in the 

\ 1 dx^ / 


It follows from the third of the equations (43) that is a 
polynomial of the second degree in x[f x^, x'^; it is there already 
divided up into three homogeneous sets of terms of degree 0, 1, 2 
respectively. By Euler^s theorem on homogeneous functions, 

3 g ^ 

the linear term disappears from the difference — f x^ 

1 

which reduces to Hence (46) multiplied by L gives 

+ = EL. 


The left - hand side is essentially positive when L tends 
to zero, being in fact, for i ~ 0, > t F^ (which is to be 
taken as having a lower limit which is not zero in the field 
considered). The product EL can therefore be considered as a 
function of the x\s and x'\v which is always regular^ and not zero, 
when L tends to zero; in the latter hypothesis the constant E 
obviously tends to infinity. 

Further, for all motions with the same total energy E, the 
principle (44), in which we suppose that vanishes at the 
extremities of the interval of integration, can be replaced by an 
analogous one which has the advantage over the first of not 
requiring this condition to be satisfied. In fact, for zero at 

the extremities, we have Sjdx^ = 0, and in consequence (44) 
is equivalent to 

sf(L — E)dxo = 0 

or, for 4= to 


and in this last equation we can drop the condition that Sx(, 
vanishes at the extremities, since if we transfer the 8 under the 
integral sign and apply it to dx^ (both explicit, and implicit in 
the x''a) we get 

which vanishes m virtue of (46). 



GEOMETRICAL OPTICS 


343 


It is therefore established that, for an assigned non-zero 
value of E) the equations of motion can be expressed by means 
of the formula 


S 



dx^ 


0 


(47) 


without the necessity of imposing any condition as to hx^. 
The function under the integral sign can be written 1 — 


E. 

EL' 


from which it appears, remembering what was said above about 
the behaviour of EL, that this function is regular and tends to 
unity if L tends to zero. Now this is precisely the hypothesis 
which corresponds to the transition from material motion to the 
limiting case of the propagation of light. Since the function is 

regular, the order of the operations S j and passage to the limit 
may be interchanged, so that (47) gives Fermat’s principle 

sj^dxQ = 0 . 


Fermat’s principle can be put in a purely geometrical form, 
referred to the spacelike metric with line element dl, if we give 
dXff the value found from ds^ = 0 in terms of Xq, x^, x^, x^, dx^, 
dx^, dxg, and insert this in the formula just above. The result 
is particularly easy to interpret in the statical case (gfo, = 0, 

i = 1, 2, 3), in which we have evidently dXf^ — - , and Fermat’s 
principle takes the form 



This shows that the light rays coincide with the geodesics 

of the three-dimensional space with line element alternatively, 

referring to the physical space dZ® and again applying the theorem 
of least action (cf. § 11), we can say that they coincide with a 

pencil of trajectories corresponding to the potential and 
total energy 0. 
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19. The stress tensor and its diirergenoe in the classical 
theory. 

Let there be given a continuous medium, and in it a surface 
element (facet) dor; one side of this facet is supposed chosen as 
the positive side, and one sense of the normal direction is associated 
with it. We shall agree that this sense is the one which corre-r 
spends to the passage from the negative to the positive side, and 
shall denote its versor by n. The resultant of the molecular 
actions which the particles on the negative side of the element 
exert on those on the positive side is ordinarily called the stress ^ 
relative to the positive side of the element considered.^ In normal 
cases— the only ones we propose to consider — this resultant is 
of the same order of magnitude as do, and is represented by ^^ido, 
where is the specific stress on the positive side of the surface 
element normal to n. 

Referred to orthogonal Cartesian axes the three com- 

ponents of the vector will obviously be denoted by • (i — 1 ^ 
2, 3). To characterize the distribution of the stresses at a single 
point P, we introduce the three stresses # 2 ’ which act on 
the facets at P parallel to the co-ordinate planes, or, more pre- 
cisely, the facets whose nornial versors are in the positive directions 
of the co-ordinate axes. Their components are denoted in order by 




^13 5 

^21 j 

^22» 

^23 > 


^32, 

^33 > 


it follows from the postulates of ordinary mechanics that the 
matrix formed by these terms is symmetrical or that 

O32 — O23, O21 — 

SO that there are really six of these quantities (i, k 1, 2, 3). 

^ Of., for example, A. E. II. Lovr, Mathematical Theory of Elasticity^ third 
edition, Chap. II ; Cambridge University Press, 1920. 

® Some authors, liove in }>artic!ular, invert the reaywetive roles of the two sides 
of the fa<;et in their detinitions, and therefore, by the principle of reaction, change 
the sense of the vector described as the stress. The sign of its conif)onentB will 
l>e changed accordingly, and the inequality will be inverted which determines 
whether a given stress is of the nature of a pressure or a pull with respect to the 
element considered. 
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Putting n* for the components of the versor n (if« direction 
cosines) we get the fundamental formula 

= ( 48 ) 

1 

and hence for the three components in the direction of the 
co-ordinate axes 

1 

If % is a generic direction of direction cosines the scalar 
product X i.o. the component of the stress along 
can naturally be written in the form 

1 

From the symmetry of the follows that in the sum 

just written down can be replaced by the sum is there- 
fore, by (48), equivalent to the scalar product X n. Hence we 
have the relation of reciprocity, expressed by the equation 

X % -- X n. 

For 5 — n we have in particular what is called the normal 
stress, i.e, the component along the normal to the facet of the 
stress with respect to the facet itself. In accordance with the 
conventions we have adopted, the strcvss will be of the nature of 
a push or a pull according as this normal component is positive 
or negative. From the remarks above, the necessary criterion 
is provided by the sign (for ^ — n) of the expression 

1 

To make the notation uniform, we shall write instead of n. 
Consider the bilinear form 

0) - (49) 

1 

which represents either the component along % of the specific 
stress on the facet normal to the component along §' of 

the specific stress on the facet normal to 5- 

( D OSO ) 


12 # 
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If now we replace the by any curvilinear co-ordinates x 
whatever (the geometrical nature of the space characterized by 

being of course regarded as invariant), then the parameters 

1 

of the directions ^ constitute, as we know, two contra- 
variant systems, reducing to the direction cosines in Cartesian 
co-ordinates, while the scalar quantity O just defined will behave 
as an invariant on account of its intrinsic meaning. It follovrs 
(cf. Chapter IV, p. 70) that the coefficients of the bilinear form O 
(referred to these parameters as arguments) will constitute a 
symmetrical covariant double system which is called the stress 
ten>sor. Extending the notation adopted in the case of Cartesian 
co-ordinates we shall denote it by This tensor will of course 

have the contravariant components and the mixed 

components Of, which can be obtained in the ordinary way by 
composition with the coefficients of the fundamental form. 

The stress tensor depends in general on the position of the 
point considered; the components 0^*., referred to geiieric co- 
ordinates X, can therefore in any case be thought of as functions 
of the co-ordinates, and therefore as having derivatives — 
ordinary, covariant, and contravariant. As we saw in Chapter 
VI, p. 153, from a given double tensor we can always obtain 
a vector Y intrinsically related to it, which we called its 
divergence, and whose covariant components are defined for 
n = 3 by « 

r, = S*, (50) 


Now the divergence of the stress tensor has an important 
mechanical interpretation, which can be found at once by using 
Cartesian co-ordinates. We know in fact that the molecular forces 
applied to a given particle by all the surrounding particles have 
for their resultant a vector x» whose components per unit 
volume, in orthogonal Cartesian co-ordinates, are given by 


Xi = 


l 50 ,, 
1 


( 51 ) 


Noting that in this system of reference the divergence of 
is expressed by precisely the sum on the right of (51), and remem- 
bering that the covariant components of a vector are identical 
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in this case with the ordinary components, we see at once that 
the vector % is the divergence of the stress tensor with its sign 
changed. Applying the formula (60) we can therefore write 

Xi= (61') 

20. The fundamental equations of the mechanics of continuous 
systems, referred to fixed axes; transformations of them in general 
co-ordinates (space co-ordinates). 

It is known that, when no hypothesis is made as to the nature 
of the medium, and when therefore the stresses are not particu- 
larized, tlie fundamental equations of the mecliajiics of a con- 
tinuous system reduce to the dynamical equation 

pi -- pF+x (52) 

(where p is i/hc density, f the acceleration, P the force per unit 
mass, and x vector defined in the preceding section), together 
with the equation of continuity 

+ fliv(pv) == 0 .... (53) 

(v being the velocity), which can also be written 

^ + pdiv(v) - 0, . . . . (63') 


whore the symbol denotes a 
dt 


‘‘ proper ” derivative, i.e. one 


which considers p as depending on ^ in such a way that as t varies 
p refers alvrays to one and the same particle of matter. 

If now we wash to find the explicit form of these two equations 
with reference to any co-ordinates x whatever, connected with 
the y’s by formulae which do not involve the time, all we need 
do is to obtain the expressions for the covariant (or contravariant) 
components of the vector f, since those of x ^^e already known 
from the preceding section (cf. formula (51')) and the invariant 
expression of div(pv) is known from p. 163, Chapter VI; the force 
P will naturally be supposed given by means of its covariant (or 
contravariant) components. 
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The acceleration t is defined by 

dt 

where the (proper) derivative is supposed to be calculated with 
respect to an observer (system of axes or, more generally, co- 
ordinate net) fixed in the mechanical sense of the word. 

Referred to co-ordinates y this relation is equivalent to the 
three scalar relations 


•'* hi di ^ I dy^ * 


1,2,3). . (54) 


If now, with reference to any co-ordinates x whatever con- 
nected with the y’s by relations which do not involve the time, 
we consider the simple system 

(54') 

it is easy to see that this is covariant. In fact, on the one hand 

the quantities — ' {t being a parameter not involved in the trans- 
om 

formations) are covariaiit like the and on the other the 

3 

quantities are covariant from the law of contraction 

1 

of tensors. Noting once more that in orthogonal Cartesian 
co-ordinates the covariant derivatives reduce to the ordinary 
derivatives, and also the covariant and contravariant components 
of a vector to the ordinary components, we see that in these co- 
ordinates the expressions (54'*) are identical with those on the 
right of (54), i.e. with the (covariant) components /• of f. This 
identity will still hold with reference to the a;'s, and we can 
write 8 

We can now find the explicit form of the equations (52) and 
(63) with reference to the co-ordinates x. The first will give the 
oovariant equations 

P = P^i 


. (65) 
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and the second the invariant equation 

+ = 0 . .... ( 66 ) 

01 1 

or + = 0 (66') 

Ot 1 

21. Qalilean systems of reference. 

Among the purely spacelike transformations a particularly 
simple group consists of those which give the change from a 
system of fixed (in the mechanical sense of the word) Cartesian 
axes to a system of Cartesian axes in uniform translatory motion 
with respect to the first set; the latter system is called Oalilean, 
The definitions of force, specific stress on a generic surface element, 
and divergence (whether of a vector or a tensor) are not changed 
in a transformation of this kind, but the velocity v of a generic 
point is altcired by the addition of a constant quantity represented 
by the velocity of translation x; this addition, however, evidently 
does not alter the acceleration (i.e. the proper derivative of v). 
It follows that such a transformation leaves unchanged the 
dynamical equation (52), and also the equation of continuity; 
the latter is evident from the form (53'), which, in addition to 
div(v) (which, as just pointed out, is invariant), contains the 
proper derivative of p, which from its intrinsic meaning is obviously 
independent of the axes of reference. 

Furthermore, all the laws of the classical mechanics are known 
to be unaltered if the axes of reference are supposed to be in 
uniform translatory motion. 

22. Equivalent form for the system (62) and (53). 

In the general equations of motion of a continuous system 
the force per unit mass P occurs explicitly. From the formal 
point of view we can always, and in an infinite number of ways, 
consider P as the divergence of a suitable tensor; its com})onents 
can then be supposed amalgamated with the O/y/s, so that ,we 
can at once put P = 0 in the equation (52). 

From the point of view of application this is not always con- 
venient, and in many cases the direct method is preferable; but 
from the speculative point of view this process of submerging 
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the force per unit mass in the stress is not only legitimate, but 
in accordance with the physical standpoint which refuses to 
admit action at a distance, asserting that every disturbance is 
transmitted by mediate action. In virtue of these considerations 
we shall put P = 0 in the vector equation (52). 

We now propose to transform, without altering their content, 
the three scalar equations included in (52) and the equation of 
continuity (53), in such a way as to replace these four equations 
by a set of four substantially identical equations.^ 

Referring to orthogonal Cartesian axes y, we project the 
equation (52) (in which we have now put P =:= 0) on the axis 
y:, using (55), we get 


, V 
~T 

1 OJ/fr 




"An ’ 

1 ^yic 


(57) 


while the equation of continuity (53) or (53') takes the well- 
known form 


I ^iP^k) r\ 

dt 1 


(58) 


Adding (58) multiplied by to (57) we get 

9(p*’i) 4. y _ y 

dt 1 dy^ 1 dyj. 


which can be written 




S- ()WA + *,») = 0. 


(57') 


It will now be seen that the quantity on the left of (57') and 
(68) is in all four cases the sum of partial derivatives with respect 
to the independent variables ^j, y,^^ y^. It follows from § 5 that, 
since p denotes the material density, € = c^p can be interpreted 
as the energy density; further, it may be seen in a moment that 
the vector pv (the momentum density) represents the flux of 
matter (per unit of surface and of time), and therefore the flux 
of energy will be c^pv = cv. 

*Cf. particularly G. D. Mattiou, JJenrf. j<rc. Linoei, Series V, Vol. 2CXII1 
(Beooiid half-year, l’914), pp. S28-SS4, 427-439. 
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Now to give greater uniformity to the equations ( 68 ) and (67'), 
and to use in them the quantities whose physical interpretation 
has just been noted, we must replace t and the v/s by their 

• V 

Bomerian expressions = ct, Pi ~ and put 

c 

Ti„=- € <^P, (69) 

Toi = Ti„ --- - -eA -- - rpv„ .... (60) 

^Hc — + pViV/c, . (61) 

(i, h = 1, 2, 3). 


The result is that the four equations (58) and (57') are all 
included in the single equation 




(62) 


by giving i in turn the values, 0, 1, 2, 3. 

From the equations (59), (60), (61), we can see the interpre- 
tation of the various T’s. rejiresents the energy density; 

{i 1 , 2 , 3) the components witli their sign clianged of the 
relative Romerian flux; the (i, h - - ], 2, 3) in statical 

comlitions (??, 0 ), reduce to the ordinary stress components, 

from which they differ in general by the additive terms 

pv^Vj^. (whicli, howejver, in ordinary circumstances are unim- 
portant compared with the other terms). To distinguish when 
necessary the from the ordinary stress we shall call 
them the kinetic stress. 


23. Einsteinian modification of the equations of motion of a 
continuous system in a particular case. 

The original equations (52) and (53), and therefore the equiva- 
lent set (62), are invariant when the axes of reference undergo 
an ordinary uniform translation. In the earlier stages of the 
argum^mt we set out to give the dynamics of a material particle 
a form which should be invariant for a generic transformation 
(T 4 ), and we were induced to use Hamilton’s princif)le in order 
to modify the equations of motion slightly. It followed from this 
operation that when there are no external forces the equations so 
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modified keep their algebraic form unaltered, not only for ordinary 
translations but also for the Lorentz transformations which we 
studied in detail in § 8. 

Now the dynamics of a continuous system must clearly 
include as a limiting case (corresponding to a medium of density 
everywhere zero except in one very small region) the mechanics 
of a single material particle. This at once shows that it is abso- 
lutely necessary that the postulates introduced for the mechanics 
of a continuous system should be brought into harmony with 
the modifications accepted above in the mechanics of the material 
particle. The form of the equations (62), when there are no 
external forces, must therefore remain unchanged for any Lorentz 
transformation. If in accordance with (59), (60), and (61) we 
take for the the expressions 

epA-, • (63) 

this condition is not rigorously satisfied, though, as we have just 
pointed out, there is invariance for ordinary translations; but 
it is easy to show that the required invariance for Lorentz trans- 
formations can be obtained by a modification, which, as usual, is 
very slight in the condition? ordinarily realized. 

To do this, we take the four-dimensional form 

dsQ** = 

used above in discussing the dynamics of a particle, where as 

3 

usual 

1 

Denoting by dyi {i = 0, 1, 2, 3) the increments of the co- 
ordinates of the generic material element of the system under 
consideration, and by dl^ and ds^ the corresponding elements 
of the (spacelike) trajectory and the world line, we have by 
definition 

A = (64) 

m a 

whence ~ (64') 

and 


dV = (1 - • • ■ (64") 
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The parameters of the world hue are 

A‘ = ^ 
dso 


(where we have suppressed the sign of absolute value, since in 
dealing with the motion of a material particle we must have 
< 1, or > 0); they can be expressed in terms of the )ff,’s, 
using (64), (64'), and (64"), in the form 

A® = — — = - ^ _ 


A» _ _ P i 

dSff ^ 1 — ^ 


(i = 1,2,3). 


From these, taking account of the general formula 

0 


and of the values of ^ corresponding to ds®* (cf. § 6, 

formula (12)), we get the moments 




1 


A, = 


Pi 

Vi - 


If we take the values of the monomials 


as given by these formulae, and compare them with the expressions 
(63) for the we see at once that the difference between 

each of them and the corresponding T,, is of the second order. 

We shall now show that if in the equations (62) we replace 
the values (63) of the Ti^.’s by the very slightly different values 

= cA, Afc (i, A: = 0, 1, 2, 3), . . (65) 

the equations will behave in the required manner for Lorentz 
transfoimations; and we shall be able to deduce the criterioil 
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to be applied for transforming the equations in the more general 


case. 

Note first that, if is taken as the fundamental form, the 
terms on the right of (65), and therefore the Ti, 's, constitute a 
covariant double system. Further, taking into account once 
more the particular values g% of the coefficients of ds^ expressed 
in terms of the co-ordinates y, it will be clear that the covariant 
derivatives of the ^ ik^ are identical with the ordinary derivatives, 
and that the terms on the left-hand side of (62) can be written in 
the form 




0 


and are therefore identical with the covariant components of 
the divcrg(ince of the tensor T,j^ (cf. Chapter VI, p. 153). These 
equations therefore collectively express the fact that the diver- 
gence of this tensor vanishes — a property which is invariant for 
any transformations whatever of both the space and the time 
co-ordinates. Remembering finally that a liorentz transformation 
leaves unchanged the form of ds^y we can now assert that the 
equations (62), with the values of given by (65), will still 
hold after the application of any Lorentz transformation. 

Q. E. D. 


24. General case. Introduction of the energy tensor, and 
meaning of its components in general co-ordinates. 

When there are no stresses, the result we have arrived at is 
that we assign to the corresponding to the motion of a 

generic continuous system the tensor value given by 

where e is the energy density and the A/s are the moments of 
the world line of the material element. Further, given any 
distribution of stresses, referred to Cartesian co-ordinates, then 
in order to transform the equations of motion into any spacelike 
co-ordinates (le^iving the time unchanged) we have traced out 
an argument based on the invariance of the bilinear form 

3 

O = ^ (which showed us the three-dimensional tensor 

1 
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character cf the when we pass to generic co-ordinates), 

on the vector character of the velocity, and on the invariance of 
the density. 

We now propose to consider more generally transformations 
(^ 4 ) of space and time (i.e. of the set y^ into a new set 

ajj, a? 2 > ^ 3)9 keeping the results already obtained in the two 
particular cases just referred to (cf. § 23 and § 20). A sufficient 
condition is that the (defined physically with reference to 

a particular system of co-ordinates) shall have the character of 
a tensor for any transformations whatever. The tensor so intro- 
duced is called the energy tensor. 

This is equivalent to asserting the invariance of a bilinear 
form in four variables 

B = 

0 

having for its coefficients the quantities and for its arguments 
the parameters 

^ ds' ^ ds' 

of two arbitrary four-dimensional vorsors 1^, 5 '. It will be seen 
at once that this postulate covers the two particular cases already 
discussed. In fact, when there are no stresses the tensor character 
of the follows from the expressions (65) adopted for them, 
while for transformations (jTg) which leave the timolike co- 
ordinate unchanged, the invariance of B involves that of the form 
O, as will be seen from the following argument. 

When we pass from a system x to a system x, it follows from 
the invariance of B that (with obvious meanings for the notation 
used) 

3 _ 

^tfc ^ a- — ^ik ^ Ik ^'^k^ 

0 0 

In the case of a ( T 3 ), we shall have dx^ = dx^^ d'x^ = 
and therefore 

S 8 3 

1 1 1 

= TQQdXod'x^+ dx^t^f^^,d'x^ + d'x^:£^^ 

i 1 1 
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and aa this must hold whatever the difierentiais may 

be, it follows that 



^ Ok ^‘^'k — ^ OA: 

1 1 

^ik ^ ik d'Xfc = dXi d* Xj^, 

1 1 

As the differentials dx^, d'x^ are arbitrary, these relations 
express the fact that ^00 is an invariant (the energy density), 
that the are the components of a vector (the flux of the 

energy with its sense changed), and the T^j’s those of a 
covariant double tensor (the kinetic stress). 

Q. E. D. 

We now propose to examine, with reference to pseudo- 
Cartesian co-ordinates y, the physical significance of the 
form B when the directions %, %' are chosen in a particular 
way. 

Suppose first that both the directions are purely timelike, 
i.e. that 

dxc == d'x; =---0 (t = 1, 2, 3) 
and therefore ds^ — dy^, ds'^ — d'y^. 

Then the only parameters which are not zero are ^ and 
which are equal to 1, and there remains 

~ ^oo» 

i.e. in this case B represents the energy density. 

Now suppose that are piirely spacelike, i.e. that 

dy© = consequently 

da^ = - di;\ ds'^ = - dQ. 


Then there remains 

B = 


y rp ^Vi 

T ^ 'di. 


^Vk 


i.e. B reduces to the linear invariant of the kinetic stress. 
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Lastly, suppose that % is purely spaoelike and purely time" 
like; i.e. that — 0, d'yt = 0 (i = 1, 2, 3), and therefore 

ds^ = — di/, d«'* = d'yo®. 

Then B = 

1 uIq 


and is therefore identical with the flux of the energy in the 
direction ^ with its sign changed. 

We can now determine the physical significance of the 
with reference to any system of co-ordinates whatever. This 
follows easily from the invariance of the form B if we allow 
that the physical significance of this form in the particular cases 
noted above remains the same in any other system of reference. 
The different cases are in detail: 

(a) The energy density at a generic instant and point 

will be what B becomes for 5? purely timelike, i.e. for 


^0 _ ^^0 _ 


1 

^^oo’ 


i.e. it will be 



a > 0); 


(6) The flux of the energy along a specified (spacelike) direction 
dx • 

a of parameters = — will be what — B becomes when 
we put in it 



i.e. it will be 


dx^ dx^ 

ds dl 


0 , = 




00 


e = 


j 

V 



o] 

I (i = 1, 2, 3), 


If in particular the direction a coincides with one of the co- 
ordinate directions, say Xf„ we have 

1 


a = 


— 9hh 
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and the other a^ ’s are zero; hence the flux of the energy in that 
direction is given by 


1 _no_ 


(c) The component in a direction a of the kinetic stress 
relative to a facet normal to a direction a' will be what B becomes 
when we put 






i.e. it will be 


dx, dxi 

[ ds I dl 

d 'Xj d'x 

Ids' I dV 



0 




0 , 


= 0 ; 


3 




T. 


ik 


a a 


tic 


If in particular the direction a coincides with that of one of 
the co-ordinate lines, say and a' with that of another, say 
we shall have 

1 . 1 

and all the other a’s will be zero. Hence the component in the 
direction of the kinetic stress relative to a facet normal to 
will be 

9r, 

Before concluding this section we wish to make one last 
remark. We have seen that when there are no stresses (the case 
of discretci ])articles of matter) the energy tensor takes the par- 
ticularly simple form 

= cA, A, (65) 


Another important particular case is when the energy tensor 
has the form 

^ ik ~ ^^4 T9ik» .... ( 66 ) 

where p is any invariant function of the position and the time. 
Id order to see the physical significance of this expression, con- 
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sider a specified point of F 4 , and take a system of co-ordinates 
which are, at least locally, pseudo-Cartesian, which we know 
is always possible. Then the take the values while if 
we make the direction coincide with that of the world line, the 
\*B all become zero, except which is 1 . 

In these conditions we shall have 

^00 = ^ Py 

Tije = 0 (i A), 

Tii = P {i > 0). 

The last two formulae tell us that on every facet there is 
exerted a stress normal to it and independent of its direction: 
the scalar quantity p measures the value of this stress per unit of 
surface. The medium under consideration therefore behaves like 
a perfect fluid (a fluid incapable of transmitting a shearing stress), 
and p represents its pressure. It is hardly necessary to point out 
that if p is negative it represents a uniform pull in all directions 
— which, within certain limits, is known to be a possible condition 
even in a real liquid. 

25. Relativistic form of the equations of motion of a con- 
tinuous system. 

In the particular case of no forces, we saw in § 23 how the 
general equations of motion of a continuous system can be put 
in the form 

Tj^ - 0 (i - 0, 1. 2, 3), . . (G7) 

0 

where the Tj/s arc regarded as elements of a tensor, and that 
this equation holds in general co-ordinates x whatever may be 
the transformations (involving both space and time) imposed 
on the original co-ordinates y. The j)roof of this consists in the 
invariant character of the equations (67) (which express the 
vanishing of the divergence of the tensor together with 

the fact that in the original co-ordinates y the equations (67) 
reduce tP the form (62), and that the quantities become 
identical (neglecting terms of the second order, if not rigorously) 
with the expressions (63) which are their values in the classical 
mechanics. All this holds without change even if we drop the 
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particular hypothesis suggested by the law of transformation of 
the when the transformations (T^) are applied; that the 

forces are zero. It is only necessary to retain the tensorial character 
of the in every case, as we have already agreed; which, in 
the particular case where stresses are present, means that 
their experimental values are determined, say, with reference 
to the co-ordinates which formed the starting-point of the 
investigation. 

The equations (67) thus hold so long as the metric considered 
is pseudo-Euclidean, and for any co-ordinates of reference what- 
ever. But the invariant expression for the laws of motion, which 
is seen to hold under this hypothesis, can be at once extended to 
the general case of any metric whatever, in virtue of the observa- 
tion made earlier in this book (cf. Chapter VI, p. 164) that in a 
first-order region every metric behaves as if it had constant co- 
efficients, and is therefore Euclidean in the proper sense in the 
case of a definite and pseudo-Euclidean in the cases which 
concern relativity mechanics (cf. §§ 6 and 12), In fact, the equa- 
tions (67) contain only contra variant derivatives of the or, 

in other words, combinations of their ordinary first derivatives 
with the and their first derivatives; the argument thus does 
not go beyond tho consideration of a first-order region round the 
generic point which is being studied. 

26. A particular class of motions of a continuous system. 

In the classical mechanics the equation (52) of the motion 
of a continuous medium, when there are no forces and no mole- 
cular action (a discrete system), evidently reduces to 

f = 0, 

with which is to be associated the equation of continuity. It 
follows that the vector equation is satisfied at once by the uniform 
rectilinear motion of single particles, the density being then 
determined by the equation of continuity. This is conceptually 
evident; in order to translate it into a formula, we assign to any 
material particle, initially at Co. a velocity y{Pq) which is a 
function (a prion arbitrary) of the position the geometrical 
equation of motion is then evidently 

P{fy = Po+ v(Po)t, .... (68) 
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which shows that the solution depends in substance on three 
arbitrary functions of three arguments each. 

If we wish to find an explicit expression for the law of variation 
of the density it is perhaps preferable to go back to the molecular 
equation of continuity instead of to the equation (63) which is 
its local form. It is a well-known result that if we introduce the 
functional determinant Z) of the actual co-ordinates y{t) with 
respect to the Initial co-ordinates we get 

= Po. 

where is the initial value of />, and is a priori arbitrary just 
as is the initial distribution of the velocity. Projecting the 
equation (68) on tlie axes, and denoting the components of v by 
yl> we get 

• t, 

whence 

It follows from this that Z) is a polynomial of the third degree 
in A, which reduces to unity for ^ = 0. Naturally (supposing that 
the ?\’s and their first derivatives are finite and continuous) the 
motion remains regular so long as 1) does not vanish; the smallest 
positive root (if such exists) of the equation of the third degree 
D 0 determines the amplitude of the interval of regularity, 
&c. 

A particular case worth noting is when the density remains 
constant for each particle (incompressible systems). In this 

case ~ == 0, and the equation of continuity, in the original 
at 

Euleriaii form (53'), gives 

div(v) “ 0 (69) 

This implies in particular that the divergence vanishes at the 
initial instant, and therefore gives as a necessary condition for 
the constancy of the density that the field of the initial velocities 
must be solenoidal, i.e. that divv(Po) — 0. This condition is 
not however sufficient. In fact, if the density is to remain constant 
it is necessary and sufficient that Z) = 1 at any instant t\ the 
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expansion of D as a polynomial of the third degree in t shows 
that this imposes three conditions, corresponding to the vanishing 
of the coefficients of t, and (69) expresses only the first of 

these conditions. Further, if these conditions are satisfied initially, 

p remains constant for every particle (i.e. ^ = 0), which ensures 

that the equation (69) is satisfied at every instant, or in other 
words that the field of the velocities is always solenoidal.^ 

We have dealt at some length with this class of elementary 
solutions, because the results can easily be generalized for any 
F4 whatever. If \i{xQ, x^, x^, x^ denote the moments of a generic 
congruence of lines in the F4, we know (cf. Chapter X, p. 274) 
that the necessary and sufficient condition tor the congruence 
to be geodesic is that the curvature vector, or, what is equivalent, 
its covariant components, shall vanish, i.e. that 

= 0 (i=0, 1, 2, 3). . . (70) 

0 

We now propose to show that in a F 4 with any metric whatever 
we get solutions of the equations (67) by taking for world lines 
the lines of any geodesic congruence whatever, or, in other words, 
by supposing that the A’s satisfy the equations (70) and by 
assigning a suitable value to the density p, and through it to the 
quantity e whicli appears in the expression (65) for tlie energy 
tensor of a discrete system (i.e, a system with no molecular action). 

Take the general equations (67), which we shall write in the 
form *{ 

= 0 = 0,1, 2, 3) 

0 

and in them give the value Aj^., We shall have 

^ik\i — + €:Aj Ajt|f 

and therefore by substitution 

eA,l:,Ai‘ = 0 (i = 0, 1, 2, 3). 

0 0 0 

’ Cf. OisOTTi ; “ Moti di un liquido che lasciiino inalterata la diatrihuzionf? 
Ificale delle prcasioni", in Hend, della R, Acc, dei Lincei^ Seritjs V, Vol. XIX (first 
half-year, 3910), pp. 373-376. The observation is there limited to the case of 
X^ermanent motion. 
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The second term vanishes in virtue of (70), and therefore the 
four equations reduce to the single condition 

3 3 

= 0 . . . . (71) 

0 0 

If we choose e so that this condition is satisfied, the equa- 
tions of motion will all be satisfied also. 

Q. E. D. 

The equation (71) defining e can be put in a somewhat more 
expressive form by using the results (cf. Chapter X, p. 267) that 

Sn/'e.A* - ie,A' = J, 

0 0 <18 

where s denotes the arc of the world line, and noting that 

g 

S*At^ = divX. 

o 

Hence (71) can be written 

€ divX == 0 .... (71') 

as 

wliich is precisely the form of the equation of continuity. 

If in particular we consider a solenoidal geodesic congruence 
(divX = 0 ), the last equation becomes 



whence e = constant along any world line; i.e. the density of 
a particle remaiias constant throughout the motion. 

27. Experimental determination of the coefficients of an 
Einsteinian 

We shall close this chapter by some remarks of a general 
character on the experimental determination of the coefficients 
9ik* 

We suppose ourselves fixed in determinate physical conditions, 
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BO that, as already noted in § 10 and § 12, we must also regard 
as determinate the Einsteinian 

3 

— '^ikOikdXidx^ .... (72) 

0 

of the field which we wish to explore by means of suitable experi- 
ments. It is of course understood that we admit the validity of 
the fundamental postulates of general relativity, and more 
precisely: 

{a) {cf. § 16) the propagation of light always takes place in 
such a way that 

ds^ = 0 (73) 

along every world line; 

(6) (cf. § 12) the world lines of the motion of a material particle 
in a field of force for which ds^ can be expressed by (72) are 
timelike geodesics for this ds^. 

We propose to show that (a) suffices to determine the ratios 
of the coefficients or, which comes to the same thing, gives 
ds^ except for a factor which can in turn be found from (6). Of 
the four parameters, will as usual denote the time, in the sense 
of the conventional time, measured at any single point by a clock 
which may be of any kind and even incorrect. However the 
timelike parameter is chosen, the mere fact that it is timelike 
implies, according to the Einsteinian theory, that ds^ will always 
be greater than 0 if Xq alone varies, x^ remaining constant. 

But, when dx^ = dx^ ~ dx^ ^ 0, ds^ reduces to so 

that the coefficient g^^ necessarily > 0, and we can therefore put 

9oo = (74) 

where c is a positive constant (introduced for the sake of homo- 
geneity) and V, like is an unknown function of Xg, x^, x^, x^ 
(a pure number, i.e. of zero dimensions). 

We shall now choose any instant Xg we please, and three 
values x^, Xg of the space co-ordinates, i.e. a point P; we 
propose in the first place to determine the ratios of the g’& at P 
and at the instant Xg. 

For this we shall use light signals between P and very near 
points in the surrounding physical sj)ace, which is by hypothesis 
(at any given moment) in one-to-one correspondence with the 
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*■ 

jsets of three co-ordinates ajj, x^. In consequence, surfaces wd 
lines in this physical space represented by equations between 
x^, at the moment x^ are perfectly determinate: in particular 
the lines (given by the equations ” constant, x^ ?= con^ 
stant) on which only x-^ varies, the lines &c. 

We shall choose two points Q and Q' very near P, on the same 
line as P. Suppose that Q and Q' correspond to increments 
(to be treated as infinitesimals) dx^ and — dxi of the co-ordinate 
Xii are zero in both cases since the displacement is along 

a line ccj. 

Suppose that two light rays start from P at the instant 
one towards Q, the other towards Q\ Let Xq + dx^^ be the 
instant when the first ray arrives at Q; + d'x^ the instant 
(not in general the same as the first) when the second ray arrives 
at Q\ Using the expression (72) for ds^ and the condition == 0 
for the propagation of light, we shall have in passing from P to Q 

goo d^o 2<7oi dx^ dxy^ + g^idx^ = 0, . (75) 

and in passing from P to Q' 

goo V — ^goi d'oTo dx^ + (7ii dx^^ = 0. . (76) 

These two equations, in which dxQ, d'x^ are known (the 
first chosen as wc please, the other two found by experiment), 

obviously give the ratios It is to be noted that if the 

goo goo 

elementary times of propagation dx^, d'x^ (found by observation) 
are equal, then (75) and (76) give by subtraction = 0. 
Reciprocally, if -- 0, the two intervals of time must be equal. 
Hence the elementary ])ropagation of light in the direction of a 
line Xi is a reversible phenomenon if and only if g^^ = 0. 

In the same way, considering the other two co-ordinate Hnes 
X 2 and Xg, we can determine the four ratios 

.9o2 .722 . 7o3 5^33 

> y 'y • 

7oo ffoo 5^00 goo 

To obtain the other three ratios 

723 731 7i 2 
7oo 7oo 7oo 
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we nuist make further experiments of tfie same type, but with 
the point Q in a direction other than those of the co-ordinate 
lines. 

Thus to determine we can use a line on the surface through 
<7oo 

P,Xi — constant, which is neither nor x^, c.g. 

*3 — Xj = constant. 

We then have, in passing from P to a very near point Q on this 
line, the increments 

0, dxg, dx^y 

with dxj arbitrary. 

If we make a light ray start from P at the instant x^ towards 
this point Q, and if dx^ denotes the small time of propagation, 
we get from (72) divided by 

dXf? -j- 2 — ® dxQ dx^ -\- 2 ^^dx^, dxo 
9<ya 9oo 

-f dx^ -f dxi -f 2 dx^ = 0, . (77) 

9m 9m 9m 


whence we got the ratio — all the other quantities in this equation 

9m 

being known or already determined. In a similar way we can 

find and 
9m 9m 

It is not inapposite to add that from other experiments of the 
same type we can get any number (in fact an infinite number) 
of further equations between the ratios of the ^r’s. The con- 
sistency of these results, in so far as this is borne out by the 
further experiments, affords a very significant control of the 
validity of the Einsteinian hypothesis so far as concerns the 
postulate (a). 

The ratios 

P* = ^ (». * = 0, 1, 2, 3) . . (78) 

9m 

being thus determined, if we put 

^ ds^ = . , . (79) 
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(using (74)), it follows that the individual coefficients of the 
difEerential form , „ 

ds'^ == _ 
fl'oo 

are aU known, and therefore the form itself is completely deter- 
mined. 

From (72) and (78), separating out the terms which contain 
the suffix 0, we get ds'^ in the form 

ds'2 == ^ -f- Qu, dx, dx^. . (80) 

I 1 


At this point we find that we have to determine the function 
V by gravitational experiments, and more precisely by experiments 
on the motion of material particles in the field in which the 
expression (79) holds for di^. 

The equations of motion are included in the variational 
equation 

Sfds 0 (81) 


Now suppose that the time a5„ is taken as the independent 
variable along the trajectory. Let Xi {i = 1, 2, 3) denote the 
dx 

derivatives and using (80), put 
dxQ 


ds' I ^ , 

UXq ^ I I 

= L (.-Ko ( I »1. »2. ^s)- i 


1. . (82) 


Then, remembering (79), the variational equation (81) can be 
written in the form 


Sf(e’'L)dxo = 0 (81') 


This is equivalent to the three Lagraugian equations 


d d{d'L) d{efL) _ ^ 

dxQ dxi ^x^ 


(i = 1, 2, 3). 


Noting that v does not depend on the x*b, and putting for the 
sake of brevity 


dL 


da: dL 
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It follows that 


dv r ^ I o A 


. ( 84 ) 


It is further to be noted that direct observation of the motion 
enables us to determine how the co-ordinates vary as functions 
of the time x^^, so that we must consider the functions 
and therefore also the derivatives x, and x„ known for every 
material particle left to itself in or projected into the field of force 
we are considering. It follows that the quantities a,*, defined 
by (83) are also known. Since 


dv __ ^*^4-2 
dxQ dx^ dxi 


it follows that ultimately the equations (84) are three linear 
equations in the four partial derivatives of the unknown function 
V, If we fix a generic point P and an instant Xq, any arbitrary 
choice of the velocity of the body under experiment (i.e. of the 
three numerical values to be assigned to Xj, ig* ^ 3 ) will give three 
equations in tlie four d<Tivativos 

dv dv dv dv 
dxQ dxi^ dx^ 8x3 


referred to the given position and time. The equations are there- 
fore more than sufficient to determine the numerical values of 
these derivatives, in the sense that by making a larger number 
of experiments we can not only determine the four unknowns, but 
also test the accuracy of the results as many times over as we 
wish- 

The derivatives of v. at every point in a certain field and at 
every instant in a certain interval, being known, v itself is 
determined except for an additive constfint; hence, from (74), 
(/oQ is known excejit for a constant multiplier, which we may 
suppose absorbed into the factor of homogeneity so that 
remains arbitrary. The presence of this constant in the expression 
for ^ 00 , and hence, by (79), in ds^, seems to be in the nature of 
things, correspomling in substance to the choice, which remains 
arbitrary, of the unit chosen to measure ds^, the space-time 
interval. 
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CHAPTER XII 

The Gravitational Equations and General Relativity 


1. Qualitative properties of the coefficients of dsK 


It follows from the results in the preceding chapter (p, 325) 
that when the variables of reference y^, y^ are such that 

they can be interpreted, without sensible error, the first as 
absolute time, and the others as Cartesian co-ordinates, then 
the coefficients of the Einsteinian of space-time, in con- 
ditions corresponding to the motion of the celestial bodies (in 
particular, of the bodies forming the jdanetary system), differ 
by very little from the difference being of at least the second 
order, in the sense explained above. More precisely we can say 
that! 

(a) The coefficient differs from 1 — — - by terms of order 

C“ 


higher than the second (cf. p. 320 in the preceding chapter), 
where U represents the ordinary Newtonian potential of the 
field considered. 

(6) The coefficients {i > 0) are of order higher than the 
second. If in fact they were only of the second order, it follows 
from p. 339 in the preceding chapter that the difference between 
the velocities of propagation of light in the various directions 
round a point would also have to be of the second order; 
this, however, is jffiysically inadmissible, as a difference of 
this magnitude could be detected by means of optical experi- 
ments. 

(c) The other coefficients gij^ (i, A! > 0) differ from by 
terms of the second or higher order. 

Now let us consider the absolute motion of a generic material 
particle P, e.g. a small planet. Let P') be the Newtonian 
potential of the attraction exerted on it by any particle P' of 
the other attracting bodies, which we shall suppose to be of fairly 
large mass compared with P, as is in fact the case in the typical 
examples offered by astronomy. The disturbing effects of P on 
the motion of P' being supposed negligible, the dependence of 
u on the space co-ordinates y^^ y^ involves the co-ordmates 
of P, while its dependence on the Romerian time y^ involves 

(D656) 
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the co-ordinates of the attracting body P*. If SP is a generic 
displacement (of components of the point P, and 

s V ^“2 
8m = Sf ^ 

1 oVi 

is the corresponding increment of u, we have 

Sm = P X SP, 

where P is the force exerted on P by P'. Further, if we consider 
a small interval of time dy^ and denote by dP' the displacement 
of P' during that interval, and by du the increment of u, we get 
similarly, applying the principle of reaction, 

du = — P X dP'. 


After this it is easy to determine the order of magnitude of 
the timelike derivative of u in relation to the spacelike derivatives. 
In fact, from the first formula, putting SP — nSl (where n is 
the Versor of a generic direction) we get the well-known result 
that the derivative of m in this direction has the value P X n, 
and is therefore of the same order of magnitude as the intensity 
F of the force; while from the second formula, on dividing by 
dy^, it follows that 


du 


„ dP' „ 1 dP' 

P X - P X - ---, 

dyf, c dt 


which shows that the order of magnitude of this derivative is 
that of /SP (with the usual meaning of j9). Hence, in the supposed 
conditions, the timelUce derivative of « is of the first order in 
relation to the spacelike derivatives. The same result holds 

2U 

without change for g^, which, as we have just said, is 1 

tr 

(neglecting terms of order higher than the second), U being a 
sum of terms of the type just considered. 

Taking the case of Pog as typical, we shall assume, in ordinary 
astronomical conditions, that; 

(d) The derivatives of the coefficients g^^ with respect to y^ 
are of higher order by at least one unit than the analogous deri- 
vatives with respect to the other y’s. 
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We can sum up all this in the statement that if we are content 
with approximate results (meaning that we stop short at terms 
of the second order), everything happens as if the coefidcients 
gif, were zero, and the other independent of y^,. This is 
equivalent to the statement that, to the given order of approxi- 
mation and m ordinary astronomical conditions, every ds^ behaves 
as if ii were statical (cf. Chapter XI, p. 326). 

2. The tensor and its divergence. The gravitational tensor. 

We have already noted (cf. Chapter VII, p. 200) that for any 
Vn whatever we can construct from the Biemannian tensor the 
symmetrical double tensor 

{ij,hk), .... (1) 

1 

and its linear invariant 

G = (2) 

1 

Tliis definition naturally holds also for an indefinite metric: 
in particular therefore for the ds^ of relativity (n — 4), in which 
case the tensor under discussion is called the Einstein tensor; 
its components are 

G^ = ij,g^'‘{ij,hk), . ... (10 

0 

and its linear invariant therefore takes the form 

G = ^i,g^ Gi, = iij, hk). . . (2^) 

0 0 

We may note incidentally that for a Fg the tensor Qnc is 
related to the fundamental tensor ga^ and to the Gaussian cur- 
vature by the fonnula * 

Gi„ = ~ Kgit (i, I: == 1, 2);. . . (3) 

^ In fact, for n =; 2, it follows from the dehiiitions of K (p. 194, formula 
(28)) and of the €- systems (Chflp. VI, p, 158) that {ij^ 7tk) ^ as can 

at once be verified, reinemberinq that the symbol (y, 7tk) either reduces to 
(12, 12) = jfiTa, or vanishes. Further, with the same definition of c, wo have 
2 

also the identities === ” t/ik' Keplacing (y, ?ik) in the formula of 

type (1') by JSTey «Ajb using this identity, ^e get (3). 
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while for a Fg the quantities reduce to Ricci’s symbols (cf. 
Chapter VII, p. 199) 

»jt (* + 1 * * + 2, A: + 1 A: + 2) 

a — , 

a 


the relation being 

^ik — <* 0 : ®<*» • • ■ • 


where tM denotes the mean curvature of the or in symbols 




a^. 

1 


( 5 ) 


For n = iy from the general formula 

12 


of Chapter VII, p. 182, it follows that in general tlie Riemann- 
Christofiel tensor has 20 algebraically independent components, 
while the elements 0^^ of the Einstein tensor provide only 10 
linear combinations. This simple arithmetical remark shows that 
the Einstein tensor cannot exhaust all the curvature properties 
of the 1 ^ 4 , but, as we shall see, it does suffice to give those of 
essential physical importance. 

Before beginning the examination of this question, we shall 
find the expression for the divergence of the tensor From 

(1'), we have by co variant differentiation 

Gik[i = {ij, hk)i, 

so that the components of the divergence (cf. Chapter VI, p. 153) 

. . . ( 6 ) 

0 0 

become 

Yi = ^jhkig^g’'^ (ij, hk),. 

0 

In virtue of the relations 


(ij, hk) (hk, ij). 
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Biancbi’s identities (formula (17'), Chap. VII, p. 183) enable us 
to substitute 

(Zi, 

for (ij, kk) j, so that we have 

3 3 

r, = - (ji, - ^jhkig^’^g^ [U, hh)^ 

0 0 

The first term is merely as follows from covariant differen- 
tiation of (2'), which by interchanging the indices can be written 
in the form 

G = - (jhhJc). 

o 

Interchanging j and I, and also h and k, in the second term, 
it becomes 

— (ji, ^h)i, 

0 

and in view of the identity 

(ji, kh) = {ij, hk) 

it obviously reduces to — We therefore have 

Yi = ( 7 ) 

which in virtue of (6) can also be written 

- iG',- = 0 (7') 

0 

Since the divergence of the tensor Ggu^ (proportional to the 
fundamental tensor gij^) is 

0 0 0 

it will be seen that (7), or the equivalent equation (7'), expresses 
the property that the divergence of the tensor 

^ 9 ik 

is zero. This tensor is called the gravitational tensor \ the name will 
be justified farther on. 
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3. Solidarity o! phyrioal phenomena. Criteria for the oon* 
struction of the gravitational equaiionfl, and reduction of the 
induotite proof of their validity to the statical case. 

In the immediate vicinity of a point and instant fixed in 
advance, a mechanical phenomenon is completely determined 
(at least conceptually) if we know, at the specified point and 
instant^ the density and velocity of the matter (or, which comes 
to the same thing, of the energy), and the distribution of the 
specific stress, which includes as a differential consequence the 
determination of the external force; the latter, however, as 
already noted (p, 349) in the preceding chapter, can be supposed 
absorbed into the stresses, the concept of action at a distance 
being as before excluded. In substance, therefore, the local 
behaviour of a mechanical phenomenon is completely determined 
by the knowledge (which is both necessary and sufficient) of the 
energy tensor 

This remark has a more general scope, since it holds also 
for phenomena other than mechanical (e.g. electromagnetic 
phenomena). 

Einstein’s fundamental view is that the aggregate of physical 
phenomena influences the metric of V^; more precisely, that at 
every point P of the there must be a local relation between 
the value of the energy tensor, which may be taken as charac- 
teristic of the physical conditions, and the behaviour of the 
curvatures of the F 4 at the point. As an abstract hypothesis, 
the possibility of some such influence, limited however to the 
spatial metric, had already been suggested independently by 
Riemann and by Clifford. Einstein completed it, applying it 
not only to the spatial metric, but to the metric of the space- 
time which includes both space and time and also, as we saw 
in § 4, p. 291, and § 10, p. 320, when studying the motion of a 
material particle, the force in the field, which is represented 
through the coefficient {/qq. 

We have pointed out just above that from the mathematical 
point of view the external force can be considered as produced 
by a suitable distribution of stresses. From the point of view 
of the classical mechanics this principle oould also be applied to 
the particular case of forces of gravitational origin; Einstein, 
however, assigns a privileged position to these forces, and 
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supposes that all actions of gravitational origin (and only these) 
are so intimately fused with the geometrical and temporal 
properties that they are directly determined by the four-dimen- 
monal ds^. Such a possibility is amply justified by the considera- 
tions set forth in the preceding chapter (pp. 291-328). All the 
other non-gravitational forces (in particular, actions of electro- 
magnetic origin), on the contrary, can be absorbed into the 
energy tensor. In order to put this view in a mathematical form, 
Einstein had to establish a relation between the difi (i.e. its ten 
ooeflScients) and the energy tensor (i.e. the ten functions Tn^y, 
he had therefore to determine ten equations. One of these was 
a necessary consequence, at least approximately, of the New- 
tonian theory. In the classical mechanics space is considered 
rigorously Euclidean, and by Newton’s law the density p of the 
attracting matter determines the field of force by means of the 
Newtonian potential 

U - 


where /is the gravitation constant, and the meaning of the other 
symbols is as usual. From this expression for V Poisson’s equa- 
tion 


A^U — — 47/p 


follows in the ordinary way for every point of the field. Since 
the density p differs from the element ^00 of the energy tensor 
only by a constant multiplier (pp. 349, 354, §§ 22, 24, and 25), 
while to a first approximation (cf . § 4, p. 291 ) we have 


^00 — 1 


2Z7 


it follows that Poisson’s equation establishes a relation between 
the Energy tensor and a sum of second derivatives of g^. 

The differential equations expressing the relation between 
the coeflScients of ds® and the quantities must therefore 
include this relation, at least to a first approximation. A reason- 
able induction suggests that in order to construct the ten required 
equations we must equate the ten components Ti, of the energy 
tensor to ten differential expressions of the second order in the 
coefficients gf/^, w'hich, the system being invariant, must them- 
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selves constitute a tensor. Now a double tensor of the second 
order is ^ven by those combinations of the Riemann-ChristofEel 
tensor which we considered in the preceding section. Accord- 
ingly, the procedure which would first occur to one would be to 
assume that the Gj/^s were equal or proportional to the 
and this was in fact what at his first attempt Einstein did. But 
immediately afterwards he reflected that the fundamental equa- 
tions must not impose on the metric properties of space-time any 
a priori limitation, in this sense that any value whatever of ds® 
must be capable of being regarded as theoretically possible pro- 
vided there is a suitable energy tensor. This property would be 
inconsistent with the condition that the and T^jt’s are to 
be proportional, since the latter tensor, from its physical origin, 
satisfies four differential conditions expressing the vanishing of 
its divergence (cf. pp. 351, 359, §§ 23 and 25), so that the 
would have to be connected by corresponding equations. The 
idea of a linear relation between the two tensors can however be 
retained without imposing any differential relation on the 
since the divergence of the tensor 

Gik — 

is identically zero, as we saw in the preceding section. If in fact 
we put 

^'tk ^^9ik ^ ^ ik . . « . (8) 

where k denotes a constant (to be subsequently connected with 
the constant / in Poisson’s equation), there will be no resulting 
differential relations between the These are the celebrated 

gravitational equations. The foregoing considerations serve 
merely to give them plausibility from the purely formal point 
of view; their physical justification follows a posteriori from 
arguments of two kinds, which, we shall now explain. 

For the moment we consider only a first approximation; i.e. 
we suppose that ds^ differs from the pseudo-Euclidean value by 
a small amount. As we saw in Chapter XI, p. 320, we may on 
this hypothesis assume 

gw I ~2y ] ' 

g^i ~ — Yi ih ^ 2, 3), 

9ile “ Si Yik, 
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where the y’s are siaall quantities of the second order. We also 
saw that (still with the same hypothesis) the equations of motion 
of a material particle, to a first approximation, depend neither 
on the y/s nor on the but only on the coe£5lcient or, 

which is the same thing, on the function y, and that they in fact 
reduce to the classical Newtonian equations 

(i = 1 , 2, 3). 

OXi 

U /m 

Since y = _ (9) 

c* 

In view of this, the problem of justifying the gravitational 
equations to a first approximation reduces to that of proving: 

(а) that one of these equations (the one corresponding to 
i = k — 0) involves only y (i.e. U) and is identical with 
Poisson’s equation; 

(б) that the other nine are consistent with values of the 
functions y of the assumed order of magnitude: their precise 
values in this first approximation are a matter of complete 
indifierence, since whatever they may be we in any case arrive 
back at the Newtonian formulae. 

We can therefore limit the scope of (a) and (6) to the statical 
case, for the reasons indicated at the end of § 1. 

The passage to a furtlicr approximation in the equations of 
motion of a material particle involves (cf. Chapter XI, p. 320) 
either the values to a first approximation of y^ and 
third-order correction 0 in the expression for It is this 

difference from the results of the Newtonian laws which, being 
within the range of astronomical observation, provides a means 
of testing whether Einstein’s hypothesis is or is not superior to 
its classical predecessor. 

At this point we are, so to speak, in conditions analogous 
to those in which Newton found himself when he substituted 
for Kepler’s kinematical laws the dynamical principle of universal 
attraction, which was capable not only of including Kepler’s laws 
as a first approximation, but also of predicting, and that on a 
magnificent scale, new facts which have since found marvellous 
confirmation. When the relativity theory is substituted for the 

‘ Newtonian, the phenomena predicted by it are much more 
(De& 5 ) 13* 
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minute, but even with present experimental resoiiroes, some at 
least of them are within the reach of experiment. This experi- 
mental control provides the second line of argument alluded 
to above in support of the gravitational equations. 


4. 0eneral equations ot Einsteinian statics. Empty space. 

When we are dealing with statical phenomena (cf. Chapter 
XI, p. 326), the ds^ of space-time has the form 

ds^ = V^dx^^—dP .... ( 10 ) 


^aa^dxidx^. 


The coefficients like F, are to be functions of X 2 , 
only; V is interpreted (cf. Chapter XI. p. 339) as the velocity of 
light, and is therefore considered essentially positive. 

With obvious meanings for the symbols, we have 


9ik — — ^ikf ffoi — 


^00= n g=--aV^ 



We shall use a dash (') to denote Christoffers symbols and 
the components of the Riemann-Christoflel and Einstein tensors 
relative to the quaternary form (10), and shall keep the ordinary 
notation without a dash for the analogous symbols and com- 
ponents relative to (10'). 

From the definitions and (11) we get 


{ik, ly 
{ik, 0}' 

{*0, 0}' 

( 00 , iy 


{Oi, ky 

Yj 

V’ 

rv^. 


{00,0}' = 0, 


. ( 12 ) 


dV 

where i, h, I, can take any of the values 1, 2, 3, Ff = — , and 

8 8®,- 

V* = is the reciprocal system with respect to the 

-purely spatial di^. 
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We shall next express Riemaim’s S3rmbols of the second kind 
for the quaternary in terms of the analogous symbols for 
<8* and of V. We have by definition, from formula (3) of Chapter 
Vll, p. 176, 


{*•. it}- = {il. r}' - A {«. 


- i [{Ih. r}' {'tA, 1}' - {Uc, r}' {ih, 1}']. 

0 


We shall examine separately the various cases which may 
occur, according to the number of the indices i, r, A, k which are 
zero. 

(1) No index zero. The first group of (12) gives immediately 

{ir, hky = AA} (13) 

(2) A single index zero. Riemann’s symbols being anti- 
symmetrical with respect to the last two indices, we need only 
examine the three ca,ses in which the zero index is r, or A. In 
each case, from the second group of the formulee (12) it follows 
immediately that Riemann’s symbols of this type are all zero, or 

{Or, AA}' = {iOyhky = {fr, OA}' 0. . (14) 


(3) Two indices zero. From the general properties of the 
Riemann-Christoffel tensor the symbols of the type {ir, 00}' 
vanish identically (for any ds^), and those of the type 

{00, AA}' = {Ojyhk) 

0 


vanish whenever = 0 (for j > 0), as in our case. There 
remain therefore to be considered the two types {Or, OA}' and 
{iO, OA}'. 

From (12) and the fundamental formula of co variant differen- 
tiation with respect to the purely spatial dP we find 


{0r,0A}' - F(F% 

{<0, M}' = ^ 


( 14 ') 


(4) Three or four indices zero. It will be seen immediately 
from (12) that these symbols are all zero. 
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We are now in a position to evaluate explicitly the sym- 
metrical double tensor G'n^, the elements of which, .as we know 
(§ 2), are 

3 3 

hky = hky d- {iO, 0*}'. 

0 1 

Introducing the analogous system 


relative to the ternary form dl^, we find at once, using the 
expressions obtained for the symbols {»r, hky. 


g:, = g^+^, 

GqIc — 0 , 

G'^=^ -FA^F. 

1 


(15) 


From these formulae and (11) we get for the linear invariant 
of the system 6r,V, 


G' 


0 




00 


1 


= — 2 


A^F 




(16) 


We have already seen (Chapter VII, p. 200) that for a three- 
dimensional manifold we can with advantage replace the tensor 
Guc Ricci’s tensor a^, the linear invariant 

Oflt 

1 


of which (cf. Chapter VII, p. 203) represents the mean curvature 
(the sum of the three principal curvatures). 

The Ga’a and a^j-’s are connected by the linear relations 

Gu, ~ ^ Ot*; 

from this, multiplying by and summing with respect to i, k, 
there follows in particular 

Q = Jit— = — 2Jit. 
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Applying these results, (15) and (16) become 


.... (16') 

which provide convenient expressions for the components of the 
Einstein tensor and its linear invariant in statical conditions. 

We can now return to the gravitational equations ( 8 ) of the 
preceding section. We note in the first place that since in statical 
conditions there is no energy flux, the components Toi vanish. 
Hence from ( 11 ) and (15') three of these equations reduce to 
pure identities, and there remain seven; six of these, corre- 
sponding to non-zero values of the indices, have the form 

a* + ^ a,* = -kT^ (i, A; - 1 , 2, 3) (17) 

in virtue of (15'), (16'), and ( 11 ), while the seventh, fori = A: = 0, 
is 

-V^^V-iG'g^= -kT^ 
or, from (16') and (16'), 

( 18 ) 

These seven equations ^ (17) and (18), as is naturally to be 
expected, reduce the Einsteinian statics to the three dimensions 
of the associated space. Their form is invariant with respect to 
the metric of this space, which has the dl^ in question as its 
fundamental quadratic form. They also involve, in association 
with the fundamental form, the two invariant functions V and 
and the covariant double system (i, i = 1 , 2, 3). The 

T 

latter characterizes the distribution of the stresses, while 


- 0, (i, * = 1, 2, 3) 

= -FA,F; 


. (16') 


^ Of. Lsvi-Civita: Rend, della R, Ace, dei Linceh Vol. XXVI (first half-year, 
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is to be interpreted as the energy density (cf. Chapter XI, p, 357), 
V representing the velocity of light, as was said at the outset* 
With regard to the energy density it is to be observed that 
no example of a negative density exists/ at least within the 
range of the better-known phenomena to-day, whether material, 
or electromagnetic in the broad sense. Hence we .may assume 
that the right-hand side of (18) > 0, and we get the following 
geometrical corollary: The mean curvature determined in 

physical space as the effect of jmrdy statical phenomena, is in every 
case either positive or zero. 

An important consequence of the equations (17) is obtained 
on multiplying them by and summing with respect to the 
two indices. Using the definition of f!M and (18) we get 

= i«(7'+ ^») (W) 

where T = (20) 

1 

and obviously represents the linear invariant of the system pf 
stresses with respect to our dl^ (of the associated space). It may 
be remarked incidentally that this invariant must not be con- 
fused with the scalar invariant of the four-dimensional tensor, 
namely, 

T = 

o 

the value of which, from (11), is on the contrary 



Consider in particular a region of space in which all the 
components of the energy tensor vanish (empty space). From 
the physical point of view, this condition can be considered 
satisfied when the region in question contains neither ordinary 

^ In fact* if at a given point there is matter at rest distributed with density 
p, this imphes an energy of material origin, which in normal conditions 
enormously outweighs all other possible contributions to the total. Moreover, 
the electromagnetic contribution to the energy density also ^ 0. Hence even 
when there is no matter it does not seem possible for the energy density to have 
a negative value. 
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matter nor electromagnetic energy, since in this case it follows 
from the mechanics of material media that the stresses of material 
origin vanish, and, from Maxwell’s theory, that the Maxwellian 
field of force vanishes, and therefore also the Maxwellian stress.^ 

With this hypothesis the equations (17), in view of (19), 
plainly reduce to the form 

AjF = 0, (21) 

= 0 (i, ft = 1, 2, 3), . . (22) 

the first of which shows that not the timelike coefficient = F® 
itself, but its square root, is a harmonic function. Also, (18) 
gives at once 

= 0 ( 21 ') 

If the energy tensor were zero throughout all space, it is 
intuitive from the physical standpoint that the Einsteiriian ds^ 
would be rigorously pseudo-Euclidean, and therefore the associated 
space rigorously Euclidean. This in fact represents the starting- 
point of Einstein’s speculative construction, which assigns any 
deviation from a pseudo-Euclidean metric to those physical 
actions which are included in the energy tensor. Serini ^ too has 
given a rigorous proof of the hypothesis, based on equations 
(21) and (22). 

5. First approximation. Connexion with Poisson’s equation.^ 

If we suppose that the expression (10) for ds^ differs by very 
little from the Euclidean type referred to Cartesian space co- 
ordinates and Bomerian time 

ds2 = dy^^— iiidy,^ 

1 

we can put (cf. § 3, and C!hapter XI, § 10 , p. 320) 

F = 1 - y (23) 

+ (»,* = 1,2,3). . . (24) 

^ See e.g. Jrans: The Muthematical Theory of Electricity arid Magnetismt fifth 
edition, 1925, Chap. VI, Cambridge XJniverHity Press, 

® EendU deUa R, Ace. dei Liiwei, Vol. XXVJI (first half-year, 1918), p. 285. 

* Levi-Civita, loco cit., and ibidem (secosid half-year, 1917)» pp. 307-817. 
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We thus have 

s 8 

= 'Li^aij^dyidyt = ^ 1 ? + '^ay{k^yidyu, (24') 

1 1 

where dl^ is the line element of ordinary Euclidean space referred 
to Cartesian co-ordinates. 

The quantities are pure numbers, like y, and the qualitative 
property we have assigned to ds^ is equivalent, to a first approxi- 
mation, to treating all these seven quantities as infinitesimals. 

It follows that Christoffers symbols 


4. 

\^yh ^Vi ^yJ 


are also infinitesimal. Since to the same order of approximation 
the quantities keep their Euclidean values 8^, it will be seen 
that the symbols of the second kind 

{iA, r} = Sj \ih, j] 

do not differ appreciably from the homologous symbols [iA, r] 
of the first kind. It follows that from the definition of Riemann’s 
symbols (p. 175, formula (3) ) we have, neglecting terms of 
higher order, 


^Wsy, 


5y<0yt %r9yt ^y.^yhC 


Hence it follows, from the deimition of the (r^’s (cf. § 4) and 
from (24), that 


= ^,{ih,hk} 


= 42: 4- _ S^yi k _ 

1*1 9yA® 9y,5yt 9yA% Syi^yJ’ 


We now return to the statical equations (17) and (18). We 
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have already made them contain Ricci’s symbols instead of 
the G'ifc’s, the relation between the two being 

a**. 


It is to be noted that is now to be considered infinitesimal, 
like the 6?,*’s and their linear invariant, so that, from (18), 
is also infinitesimal. Replacing ^hy its value (18), the explicit 
expressions which represent the at^’s to a first approximation 
take the form 


1 




Yik 




{%, 1,2, 3) 


] 

^yi^yJ 


+ 'c2’ooS? 


(25) 


Using this result, and noting further that, neglecting infini- 
tesimals of higher order, the covariant derivatives of F — 1 — y 
do not differ from the ordinary derivatives, so that in particular 


AaF = -Aly 


-s 


we find that (17) and (19) can be written as 

a» - {y)i, -f A" y 8f = - K T*, . . (26) 

ASy = -i/c(r-l-roo), . . . (27) 


where the symbols (y),-^. denote covariant derivatives of y; 
and in particular, in empty space, since the terms on the right 
vanish, they become 

== (y)a-. (26') 

A”y = 0, (27') 


which to a first approximation, as is naturally to be expected, 
are identical with (22) and (21). 

At this point we must consider the mechanical significance 
of the function y, or, better, of the product c^y. On p. 293, Chap. 
XI, when dealing Avith Einstein’s modification of Hamilton’s 
principle, we saw that, when ds^ is very close to the pseudo- 

Euclidean form, the difference between gf^ and unity is — 2 ^ 
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to a fixst approximation, V being the potential of the field of 
force in which the motion takes place. In the present case this 
difierence is — 2y, so that we have 



This conclusion could of course also have been deduced 
from tile general proposition in § 12 of Chapter XI, p. 328, that 
— f e® F* (together with a non-essential additive constant) con- 
stitutes the potential function of the force exerted in the field 
in statical conditions. In our case F® = 1 — 2 y, and therefore 

_ Jc®F® - _ Jc®(l-2y) == - ~+c®y, 

which proves the required result. 

Now let us for the moment again take the standpoint of the 
classical mechanics, and consider the field of force due to a generic 

distribution of matter of density p == where € is the corre- 

spending energy density. If V is the Newtonian potential of 
this field, we know that Poisson’s equation 


holds, / being the coefficient of universal attraction. If on the 
other hand we take the standpoint of general relativity, the 


same distribution of matter gives a ds^ for which y = — , and 


an energy tensor whose component ^00 coincides with c, while 
the components Toi vanish in statical conditions, so that the 
remaining components 2^ represent stresses (cf. Chapter XI, 
p. 368). If we are dealing with discrete matter, the components 
and therefore also their invariant T, are zero, and (27) 
becomes 


A,!/ = — 


In order that this may be identical with Poisson’s equatiem, 
it is necessary and sufficient that the constant k of the gravi- 
iational equations and the universal constants / = 6-7 X 10“® 
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and c = 3 X 10^® (in C.G.S, tmits) of the classical mechanics 
should be connected hy the relation 


K 




(28) 


which gives in round numbers (C.G.S. units) 

/<r = 2 X 10-48. 

For the remainder of the argument we shall adopt this value 
of K, and shall definitely take up the standpoint of relativity. 
In relation to the remarks in § 3 we can at this point consider that 
the preliminary justification of the gravitational equations is 
terminated. In fact, their first approximation is represented in 
statical conditions by (26) and (27). The equation (27), as we 
have now proved, is identical with Poisson’s equation; the 
equations (26), as we shall see in the following section, serve to 
determine the quantities which to a first approximation, as 
we have already said, do not influence the motion, but will 
become essential when we come to discriminate on a more refined 
scale between the Newtonian mechanics and the relativity theory. 
Here we have referred specifically to the statical case, but the 
justification of the gravitational equations obtained in this case 
also holds good, as already pointed out in § 3, in the general 
case, provided the coefficients y^ of the product terms in dxQ dx^ 
(i = 1, 2, 3) are of order higher than the first. We have arrived 
at this condition by a process of induction from experimental 
facts, and have used it to reduce the ten gravitational equations 
to the seven of (17) and (18). We are now so to speak at the 
deductive stage, and must first show that the gravitational 
equations contain in 83 aithesis all the facts to a first approxi- 
mation; and at this stage we must point out that in ordinary 
conditions of material motion (i.e. with velocities which are small 
compared with that of light) the three gravitational equations 

- i Ggoi == -K To, {i = 1, 2, 3), . (29) 

which are rigorously true in statical conditions, continue to hold 
to a first approximation if we suppose the quantities y, of order 
higher than the second (that of y and of the y^^’s). In fact, the 
left-hand side of these equations, as we have already seen (cf. § 4), 
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beconued identically zero when we put = 0: this means that 
if the three g^i^ are treated as quantities yi b, certain order 
of smallness, the left-hand side of (29) will be of at least the same 
order. ^ If therefore we suppose that the y/s are of order hi^er 
than t^e second, the left-hand side of (29) will also be of the 
same order, and therefore zero to a first approximation. As 
regards the right-hand side, we know (cf. Chapter XI, p. 356) that 
in a pseudo-Euclidean metric, and therefore (neglecting terms 
of higher order) also in the case we are considering, 

^OC ~ ^ — ^00 Pit 

and hence, from the presence of the factor )S,-, it follows that Toi 
is of higher order of smallness than and therefore that the 
right-hand side of (29) is of higher order than — and is 

therefore zero to a first approximation. Hence, in these con- 
ditions, the equation (29) is satisfied. 


6. The Einsteinian ds^ which corresponds to a first approxi- 
mation to an assigned Newtonian field. 


Suppose a Newtonian field and its potential U given. From 
the remark made in § 1, we can ignore the possibility (consequent 
on the motion of the material masses) that U may depend 
explicitly on the time, and treat U only as a function of the space 
co-ordinates, as if the masses were at rest in the positions they 
occupy at the instant considered. C\»nsider a re^^gion not occupied 
by attracting masses, in which region Ag ?/ = 0. In order to 
characterize the corresponding Einsteinian ds^ to a first approxi- 
mation we have to determine (cf, § 6) the functions y and y^j^ 


where y is given by y = and is therefore harmonic (i.e. a 

C“ 


solution of (27')), and the yuc’s have to satisfy (26'), which can 
also be written in the simpler form 


a-uc 


= 


(26") 


^ The quickest way of showing this is to suppose that the are of the form 
hy *9 where A is a numeiioal coefficient determining the order of magnitude, and 
the 7 *’b are functions of position and of the time, to L>e treated as 'finite quantities 
together with their first derivatives. It is clear in this case that the left-hand 
side of (29) contains A as a factor. 
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sinoe the covariant derivatives which would occur on the right 
woidd difEer from the ordinary derivatives by terms of higher order, 
and can therefore be replaced by these ordinary derivatives. 

For the integration of these equations we note iu the first 
place that we get a particular solution by taking 

Yiic = 2Sfy. (30) 


The proof follows immediately from the expression (26) for 
the aft’s, in which is of course put eqiiaJ to zero. Substituting 
for ttft in (26") the values 




^^Y hk 1 


. (29') 


and remembering that y is harmonic, the required result follows. 

Since then the equations (26'') constitute a linear iion-homo- 
geneous S5"stem in the the general integral is obtained by 
adding the solution (30) to the most general solution of the 
equations with the right-hand side zero, i.e. 

The general integral of this system could easily be constructed 
by using the result (cf. Chapter VII, p, 200) that for a three- 
dimensional manifold the vanishing of Ricci’s symbols^a*-;;, implies 
that all Riemann’s symbols are likewise zero, or in other words 
that the quantities 

+ Yik 

are the coefficients of a Euclidean dl^ (referred to any curvilinear 
co-ordinates whatever). But, as it happens, the addition to the 
particular solution (30) of the general integral of the homogeneous 
system has no interest, since, as we shall see shortly, this corre- 
sponds merely to a change of the co-ordinates of reference. 

In fact, the vanisliing of the symbols a^, as we have just 
pointed out, expresses the necessary and sufficient condition that 
dP should be Euclidean, i.e. reducible, with a suitable choice of 

parameters, to the form Hence, if denote the 

1 

co-ordinates of reference in their most general form, the most 
general method of defining a Euclidean dP, with respect to these 
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co-ordinates x, will evidently be to introduce a transformation 
of any kind 

Vi = y> a«) (* = 1. 2, 3) 

betweoa the y*a and the x% and to take for the ooe£S.cients ai,. 

3 

those which result from expressing S, in terms of the difEeren- 
tials of the ®’s. ^ 

Assuming the functions y^ (%, x^, x^) in the form 

t (ah. »2. »3). 


as is always legitimate, and inserting the corresponding difieren- 
s 

tials in the trinonoial S, dy^^, we get 
1 

3 


where 


(at +ID + m^; 


In order to take account of the condition that the difference 
^tk — Sf — is limited to the first order, together witli the 
further condition that the difference between the Cartesian 
co-ordinate system of the t/’s and the curvilinear system of the 
x’s is to be of the same older, ^ it is sufficient (and necessary) that 
wc should be a})le to treat the functions f and their derivatives 
as infinitesimals. Tt follows that 





(31) 


which constitutes the formal expression for the general integral 
of the homogeneous system = 0 (the a^^’s, in the form (29'), 
being linearly dependent on the yj^’s). 


^ If tins condition is not imposed, the only necessary condition is that the six 
numerical quantities 


yik 



4- 1 

dxi dxjc 


should be infinitesimal, and this can be secured, ab Prof, Almansi has shown 
(cf. “ fi’ordinana teoria dell' elasticity e la teoria delle deformazioni finite in 
Rend, della Jl. Acc. dn Limeij Vol. XXVI (second half year, 1917), pp. 3-8), 
even when the quantities f are not themselves infinitesimal. 
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But it is not this formal expression with which we are con* 
cemed, but rather the circumstance that the term (31), which is 
to be added to (30) in order to get the general integral of the 
system (26'') with the right-hand side not zero, can always be 
made equal to zero by a suitable change of co-ordinates; this 
change being the substitution for the x"s of the combinations 

= *<+ *2, a^), • . • (32) 

the result of which is that the expression for dP reduces, by 
8 

construction, to dy^, all the differences — SJ vanishing. 

1 

When the y’s are chosen as variables, the transformation (32) 
must naturally be applied also to the particular solution (30). 
But since the f’s are to be considered infinitesimal equally with 
y, (32) reduces, so far as (30) is concerned, to the mere substitu- 
tion of the y's for the re's. The expression (30) for the particular 
solution, which alone is of any interest for our purpose, thus 
remains unaltered when the system of reference is changed to 
the y’s. 

It is further to be noted that the elementary form A®y (the 
sum of the second derivatives) of the parameter also remains 
unaltered. 

From the foregoing arguments we see that in an empty field 
the statical potential V (Newtonian to a first approximation) is 
associated with a metric 'ttudijwation of the associated three-’dimen- 
sional space. With a suitable choice of the co-ordinates of reference 
(the just defined) we have 


V 



with y harmonic (in the y’s as well as in the sc’s); the values of 
the coefficients of the square of the line element are given 
(to the same degree of approximation) by the expression 
8j(l + 2y), so that 

dP => (1 + 2y) {dxj^ -f- dxg® -|- dx^^). 

It will be seen that in general the space does not remain 
Euclidean even to a first approximation, but, to this degree of 
approximation, can only be conformally represented in a 
Euclidean space. 
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To 6um up, remembering that g^oo — ^ and that g^yi = 0, 

the ds^ of the Einsteinian spaoe-time belonging to an assigned 
Newtonian field of force with potential U = is given by 

ds^ (1 — 2y) dV — (1 4- 2y) . . (33) 


where dl^ is the line element of a Euclidean space. 

In the case of a single point-mass we need of course only 
take 


y = 


/^o 1 
(? T* 


where r represents the distance between the mass and the point 
at which the attraction acts. 


7. Farther approximation for the coeflOicient ^ ^ 

statical conditions. 


In the preceding chapter (p. 320) wc saw that if the ds^ of space- 
time is not far removed from being pseiido-Euclidean, then the 
motion of a material particle is affected only to a first approxi- 


2U 

mation by the second-order difference 2y = — between and 


unity, so that the results are the same as for the Newtonian theory. 
If, however we wish to proceed to a further approximation, i.e. 
to calculate the principal part of the Einsteinian correction to be 
applied to the laws of the classical mechanics, we must not only 
find the second-order quantities which are the differences 
between the a^^/s and the Euclidean values (the y/s being of 
higher order), but we shall also need an evaluation of == 
carried to the fourth order. 

This is easily found if we limit the investigation to the statical 
case and to a portion of empty space (with the energy tensor 
zero). The differential equation (21) of § 4 is then rigorously 
true, i.e. 

Aa F = 0, (21") 


it being of course understood that refers to the spacelike dl^. 
To a first approximation, as has already been seen, we have 


F2 = 1 - 2y, 
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where y is proportional to the potential of the field, by (9). We 
shall accordingly have to put 

F=l-y-0 (34) 

where ^ is to be of higher order than the second. The explicit 
expression of in generic co-ordinates (cf. Chapter VI, p. 154) 

gives in the first place, by (21") and (34), 




( 21 '") 


From this wc have to find 0 to a fourth-order approxima- 
tion. As y is already of the second order, in calculating y* 

^from Yi -- and we need only consider terms as far as 

the second order, i.c. we can use the form (cf. formula (33)) 


This gives 


dP = (1 -f 2y)di!o^. 
\/ d (1 2y)^, 

^ 1 + 2y’ 


whence, neglecting terms of higher order than the fourth, 
•Jay’' — yj.(l -j- y) — yi-\- i(y®)/- 


A jtriori we do not yet know the order (by hypothesis cer- 
tainly higher than the second) of the additional term tp, which 
we have to calculate not only as far as its principal part of order 
V, but also so as to include additional terms, if any, up to the 
forut^h order inclusive. For the moment we shall consider the 
part of order v. On the left-hand side of (21'") we can substitute 
1 for %/a, the difference between these two quantities being of 
the second order, which is equivalent to neglecting terms of order 
V -f- 2. As y is harmonic, it follows that 

Ag(^ f- i/) - 0 
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3 

where Ag as before represents Laplace’s operator 

1 


08 

^8 


, whence 


0 — — + a harmonic function. 


With suitable hypotheses as to qualitative behaviour, it will 
be seen that the additional harmonic fimction must vanish, and 
there remains 

^ ^ 

as the principal term of the function t/f. As this is already of the 
fourth order, we can take — as the expression for ifi correct 
to the fourth order inclusive. 

To the same order of approximation we get 

ffoo F* = (1 — y f 

= 1 - 2y + 2y2 (35) 


8. A theorem ot mechanical eqaivalence.^ 

From the two preceding sections it follows that to a sufficient 
degree of approximation the Einsteinian ds^ which corresponds 
to a statical Newtonian field of potential U, fixed in advance, 
is given by 

ds^ = (1 - 2i>)dy^^ - (1 + 2y)f7Zo2, • . (36) 

where V - ^ • (3'^) 

c8 

= (37') 


(cf. formula (35) in the preceding section). 

In (36) we are satisfied with the first approximation for the 
coefficients of the spacelike (ZIq®, while for F* the part which is 
of the fourth order is also given. This formula is a particular 
case of the ds® considered on p. 320 of Chapter XI (formula (27)). 
In order to define the motion of a material particle, i.e. the 
geodesics of a space-time of this kind, in accordance with the 
criteria of § 10, p. 320, we note first of all that (36) gives us 


d^ 

ho 




^Cf. LRVi-CtviTA, Rmd, Acc, Lineeh Senes VI, Vo). IV, 1926, pp. 3-6. 
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comparing^ this with the equations (28) on p. 321, and noting 

( <Ko\* 

i® identical with ^ we see that it corresponds 


to the particular case in which the linear form 


Tj vanishes and 

the quadratic form Tj reduces to y^. This brings us back to 
the case considered in § 11, p. 323, the necessary values for 
the s3nnboIs then used being 

o 217 , » ^ 


Equation (31) on p. 325 gives 

+ ••■(»«) 

which leads to the following theorem: The trajectories of the 
Einsteinian rfioticrn cmncide. to a second approxiniMion with 
those of a Newtonian motion in ordinary Euclidean space for 
which the total energy is still E and the force is derived from the 
potervtial Ui- 

If is the ordinary time in this auxiliary Newtonian problem, 
the corresponding integral of ms mm is 

This integral can be put in a more convenient form for the 
purpose we have in view. From equation (31) on p. 325, 
neglecting terms of higher order, T/i + N, which we shall call 
can be written in the form 

U* = (E7+cV + ^)(l + ^J+x). 

2XJ 

whatever may be the values of tfs and x* In our case, since x > 

0 we shall have 

” *(§)'= • <-> 

Further, we saw in the section referred to that for the 



39® ABSOLUTE DIFFERENTIAL CALCULUS 

Einsteiiiian motion, to the assigned degree of approximation, 
there exists the integral 

4- 2^ + _ (J7 + cV) - E. 

If t is the variable which acts as the time in this problem, 
. Substituting for x ip their values, we can write 

From this and from (39) we get the differential relation 
dl — dty ^1 “J- "^;2 ^ ’ 

whm the Newtonian problem is completely solved^ this relation 
enables us to find also the law of the time in the Emsteinian motion, 

9. Motion of the planets according to Einstein, to a second 
approximation. Displacement of perihelion. 

The most striking application of the foregoing result is to the 
problem of the motion of the planets round the sun. If we treat 
the planets (as is in fact usually done as a first approximation) 
as material particles with mass so small compared with the sim 
that they do not perceptibly affect the field (or more generally 
the four-dimensional metric associated with the field), then Our 
problem is essentially that solved in the preceding section, for 
the particular case in which the function U is the potential of 
a single mass (the snn) which can be taken as the origin O 
of the co-oi*dinates. We have therefore, as in § 6, 

u = 

r 

where r is the distance between the sun and the planet, measured 
as if the space between the two were rigorously Euclidean. We 
know from the preceding section that as regards the trajectory 
everything happens as if the ordinary mechanics held and the 
planet were acted on by a unitary central force derived from 
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the potential (38). This consists of two terms, the first of which, 
V , corresponds to an attraction inversely proportional 
to r®, of radial component 



where for brevity we have put 


h = 



(40) 


and the second to a disturbing force, also central, but inversely 
proportional to r*, of radial component 

3 dVl ^ _K 
dr r®’ 


where 



(41) 


There are thus two modifications of the Newtonian law: 
(1 ) a change in the coefficient of proportionality, which becomes 

/w„^l + instead of (2) a disturbing force (of the second 

order relative to the Newtonian force) inversely j)roportional to 
the cube of the distance, and therefore of the type already con- 
sidered by Newton. Now it is known from the theory of central 
forces^ that for motion in a plane under a force whose radial 
component is 

Jo J^\ 

y2 


the equation of the orbit in polar co-ordinates r, Q can be put in 
the form 


y 


V 

1 -f- e cosa0 


. (42) 


^ See e.g. LevT'Civita and Aaiamh; Lezioni di Meccanica Rationale^ VoL II, 
p. 200 (Bologna, Zaniclielli, 1926); or Bamh: Difuamics, second edition, Chap. 
XI, § 91 (Cambridge University I'ress, 1923). 
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by a. suitable choice of the direction of the polar axis^twhere, 
O behig the area-constant, 


^ \ 1 k 


and e is a constant of integration, which can always be supposed 
positive, d being if necessary replaced by ^ 

AIJ this holds generally. Now suppose in particular that 
e <! 1, denoting elliptic motion as a first approximation (i.e. for 
Jc^ = 0, so that a reduces to unity). We can also suppose e >• 0, 
which means that we exclude the case of the circular orbit. 
With this limitation, 6 in equation (42) can be made to vary 
without restriction, and the equation shows that when d increases 

27T 

by — , r again takes the same value. This holds in particular for 
a 

the minimum value of r (i.e. perihelion); and therefore, for two 
successive passages through perihelion, the anomalies differ by 


217 


In the particular case a == 1 (elliptic orbit with fixed peri- 

a 

helion), the value of this difference is precisely 277, so that the 
difference ^ \ 


represents, in magnitude and sign, the angular displacement of 
perihelion in one revolution. With the value of a given above, 
taking into account the smallness of we have 


Since for a, which is already a correction, we need only a first 
approximation, we can take for its Newtonian value ^ 

G® = fmoP = — e®), 

where a and e denote respectively the semi-major axis and the 
eccentricity of the orbit. Using the value (41) of we get for 
the displacement of perihelion the expression 

a = 

i — ^ 

which was first calculated by Einstein. 

1 Cf. Lsvi CivrrA and Amali>z: op. eit.j p. 212. 


( 43 ) 
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In order to adapt the formtda to numerical calculation for 
any planet, we introduce the mean radius of the earth’s orbit, 
and write (43) in the form 

^ a = ^ /wq ^ 

1 — a’ 


The eccentricity of any planetary orbit being small, we can 
at once put e* = 0. The radius of the orbit being a, we know 
that the velocity v in the orbit is given by 

a a2’ 


which expresses the equality of the attraction and the centripetal 
acceleration; For the earth we have in particular 



and accordingly (43) becomes 


C2 


C„ 


(43') 


The velocity Wq of the earth in its orbit being practically 
30 km. per second, and c being 300,000 km. per second, we have 

'V 

approximately — = 10"*, and therefore 

c 

CT = Off . 10-8 % 

a 

For Mercury, the planet nearest the sun, and therefore 

evidently showing the most perceptible efEect, — = 0*39, which 

«o 

gives for or a little more than one-tenth of a second. Since Mercury 
completes about 420 revolutions in a century, we thus find for 
the perihelion of its orbit the centennial displacement of 42", 
which corresponds exactly to the difEerence between the total 
observed displacement and the amount predicted by ordinary 
celestial mechanics from the Newtonian theory of the per- 
turbations due to the other planets. It was precisely this 
residual shift of about 42" per century which before the 
birth of the relativity theory could only be explained by 



400 


ABSOLUTE DIFFERENTIAL CALCULUS 


introdTicing hypothetical disturbing forces with constants deter- 
mined od hoc. 

For the other planeta, the corresponding calculation naturally 
gives a much smaller centennial shift, hardlj; 8*6" for Venus, 
3*8" for the earth, 1*35" tor Mars, and still less for the others, 
and the results of observation which are at present available are 
not accurate enough to provide any basis of comparison with 
these figures. 

10. Displacement of the spectral lines. Deflection of light. 

In this section we propose to examine the effect of a field 
of foree on the frequency and the path of light rays. We suppose, 
as in the preceding section, that the field is statical, with a New- 
tonian potential U, and we consider regions of the field external 
to the attracting masses. The effect to a first approximation will 
be su ffi cient for our purpose, and we can consequently assume 
that the expression (33) of § 6 

ds2 =. (1 2y)da;„2 — (1 + 2y)d^o^ • • (33) 

where y stands for — , holds for the four-dimensional ds^. 

Now suppose that a phenomenon which is predominantly 
timelike (e.g. the vibration of an atom) takes place at a specified 
jM)int T. If dt is an elementary interval of time in which this 
phenomenon is considered, and if within this interval the varia- 
tions dy^ of the space co-ordinates are assumed to be negligible, 
we shall have from (33) (since Xq ^ ct) 

ds% ~ (1 — 2yj)c^dt%, 

where the suffix T denotes that the values in question are 
those belonging to the phenomenon at the point T. If tlie 
phenomenon takes place instead at another point S we have 
analogously 

= (1 — 2yg)c*<fe5. 

Now suppose that we have two identically similar phenomena 
at different points, e.g. the emission of light from two atoms 
chemically alike and in identical physical conditions. If we 
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^dxnit that iii such a case the space-time interval will be the 
same for both, the foregoing formulsB will give 

S? = ?L~ y = 1 _ /v. — 


This differential relation between corresponding times of the 
two phenomena under discussion, expressing the constancy of 
dU 

the ratio naturally implies that the same ratio exists between 

dtj, 

any finite pair whatever of corresponding intervals, and A^^; 
in particular, if the phenomenon considered is periodic, between 
the respective periods or between the reciprocals of tjie fre- 
quencies and i/y.. We thus have, neglecting terms of the second 
order, 

= yr-y. = Vs)> 

which shows that in a gravitational field tlie variation of the 
frequency is of sign opposite to that of the potential; hence, in 
particular, there will be a reduction of the frequency for a given 
spectral line (and therefore a shift of the line towards the red end 
of the spectrum) on passing to a region of higher potential. 

By way of example, let us compare two monochromatic light 
rays emitted in the same conditions on the earth T and on the 
sun S. We can' neglect TJ rj, in comparison with U ^ and take for 
TJ ^ (cf. the preceding section) the value 

rr __ /^o __ 


where Tq denotes the sim’s radius. As we saw in the preceding 
section, we now have 

2 . 


the (relative) variation of the frequency, if Av 
therefore given by 


Vs __ 


. (44) 


(0656) 


14 
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and since in round numbers 


we get. 


= l0-^ ^ = 200, 

c To 

= 2x 10 -®. 

V 


It was uncertain for some years whether there did in fact 
exist a shift of this kind towards the red for the solar rays, as 
compared with corresponding rays emitted from a source on the 
earth. The most recent measurements by Perot, Fabry, and St. 
John tend to confirm its existence. 

A more remarkable verification has recently been provided 
by St. John, who, following up a suggestion of Eddington ’’s, has 
observed analogous displacements in the spectrum of the Com- 
panion of Sirius. 


We now pass to the consideration of the path of a light ray 
in a field of force. Along any ray we shall have in the first place 
(cf. Chapter XI, p. 33G) and further, the field being 

statical (Chapter XI, p. 340), Fermat’s principle 

Sjdx^ — 0 

will also hold. 

Since ds^ — 0 , the expression (33) for ds^ gives 

^^2 — ^ dl ^ 

— 1 — 2y ’ 

and therefore, neglecting squares of y, 

dx^ (1 -|- 2y) dlQ. 

The rays are therefore defined by the variational equation 

s/(l + 2y)dio = 0 (45) 

At this point we note that in an ordinary Euclidean medium, 
isotropic but not homogeneous, of refractive index /x(yi, y 2 , ^ 3 ), 
the geometric path of a ray, by Fermat’s principle, is charac- 
terized by the variational formula 

sfadl^ = 0 ; 
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comparing this with (45) we see that in our field of force, with its 
given by (33), light is propagated as if the space were Euclidean 
and filled with a medium of refractive index 

p. = 1 + 2y. 

This remark becomes even more expressive if we refer once 
more to the trajectories of a dynamical problem. In fact, as 
we have already had occasion to show in § IJ, pp. 323-325, 
the principle of least action leads to the result that the curves 
(45), or, what comes to the same thing (multiplying by c® and 
remembering the meaning of y), those for which 

S/c2(l + 2y) dlo = s/(c2 + W) dlo = 0, . (46') 


can be considered as the trajectories of a material particle in 

ordinary space in a field of potential + 4y) = - + 21J 

2 


and with total energy zero, or, if we prefer, in a field of potential 


2lJ and with total energy 


2 ' 


It is interesting to observe that even in the classical mechanics 
the mere hypothesis of the materialization of energy leads us to 
predict a curved path for rays in a gravitational field. If in fact 
we admit that light rays, regarded as lines of flux of energy, are 
effectively trajectories of material particles, then each of these 
rays— their mutual reactions being supposed negligible — ought 
to behave like a free material particle moving under the action 
of the force in the field (of potential U) with a velocity which 
tends to c at an infinite distance from the attracting masses 
(i.e. for U = 0), or, which comes to the same thing, with total 
energy per unit mass. It will be seen that general relativity 
implies, to a first approximation, solely the substitution of 2f7 
for U, Now apply these considerations to tlie path of the rays 
in the sun’s gravitational field. In accordance with the above 
remarks, these rays are to be considered the trajectories in the 
problem of the motion of a point attracted by a fixed centre of 
force, the potential, with the same notation as before, being 


2/mo 


W = 
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and the total energy 

E ^ 

These trajectories are obviously conics with a focus at the 
centre of force. The species will depend on the sign of the constant 
E; in our case ^ > 0, so that the curves are hyperbolas. Since 
the divergence from a rectilinear path must be very smalb it is 
self-evident that these hyperbolas will be only very slightly 
curved; this can also be proved analytically from the differential 

equations. To show this, let n and - denote the direction of the 

P 

principal normal and the curvature at any point of a ray. 
Equating the centripetal acceleration to the centripetal force 
per unit mass, we have 

p dn 

The derivative represents the force in the field in the 
dn 

direction n, and cannot therefore be greater than the intensity 
= U i of this force. Further, the integral of vis mva 

T 

+ 2t; = 4c2(i + 


shows that if we neglect terms of the second order, v may be 
taken as equal to c. Consequently we have 


11 2 //^ 
rz * 


If Tq is the sun’s radius, the maximum possible value for the 


force 


2/>»o ; 




in the space traversed by the light rays is evidently 


that given by r = the above inequality can therefore be 
written 


1 -2/mo 

p 




ro 


is the value 17^ of the potential at the surface of the 
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sun, and the value of the ratio is 2 x 10"® (cf. formula (44)), 
we get finally ^ 

- <4 X 10-«i. 


In other words, if the radius of curva- / / 

ture p is not infinite as for a straight / / 

line, it is at any rate of the order of a j 

million times the sun’s radius. / / 

It is therefore perfectly legitimate to j 

assume that the rays are in any case / / 

only very slightly bent, even if they I Y \ 

pass very close to the sun; in every case, 0 /\ ly S J 

therefore, the hyperbola in question will M ^ 

have its asymptotes OA\ OT' (cf. fig. 4) / \ \ 

almost in one straight line. / \ \ 

* Consider in further detail the hyper- ' \ \ 

bolic ray which grazes the solar sphere \ \ 

at F. Let O be the centre of the hyper- \ \ 

bola, S the centre of the sun and there- \ \ 

fore the focus of the given branch of the V' y 

hyperbola. V will be its vertex, and, Fij?. 4 

if a denotes the transverse semi-axis 
and e the eccentricity, we shall have by definition 

OV == a, OS == ae, SV = = a(e — 1). 

We know also from analytical geometry that if 8 represents 
the exterior angle between the two asymptotes 

. 8 1 


In the case we are considering, 8 must be very small; hence, 
from this formula, e is very large. Our results will be quite suJBi- 
ciently accurate if we take the sine of the angle 8 as equal to the 

arc, and consider - negligible in comparison with unity. Thus 
e 

we can write 2 2 1 2 
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Using the relation = a{e — 1), we get finally as the 
measure of 8 in terms of the two lengths and a 


2a 

^0 


(46) 


In the classical theory, the transverse semi-axis a in the 
hyperbolic motion due to the Newtonian attraction of a mass 
M is coimected with the constant E of the vis viva by the relation 


E 


fM 

2a 


Putting for E its value \e^, and noting that in our case 
M = 2mQ, this gives a, and (46) becomes 


5 __ 4 /mo 

^ ^ V-o’ 


(46') 


and therefore, using the numerical value already found for this 
expression, 

S = 8 X 10“«. 


The right-hand side is a pure number, which gives the angle 
S in radians. In seconds 

S == 1-7" (47) 

It will at once be seen that this angle S gives the measure of 
the dejlection, i.e. the maximum angular deviation to which a 
stellar ray can be subjected by the sun’s gravitational action. 
Suppose in fact that we are considering a ray of light which starts 
from a star A ami arrives at a terrestrial observer after describing 
an arc of a hyperbola which grazes the solar sphere at F, as in 
fig. 4. The direction of the hyperbola at T, along which the 
observer receives the light ray, is indistinguishable from that of 
the asymptote OT'; the direction in which the light left the star 
is that of the tangent at A, which in its turn is indistinguishable 
from the other asymptote A'O, so that the deflection is the 
exterior angle between -4' O and OT', i.e. 8. 

The direction A'O will naturally be identified with the 
direction in which T sees the star in normal conditions, i.e. when 
the sun leaves the earth-star direction and the corresponding 
gravitational perturbation becomes imperceptible, so that the 
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visual ray again becomes rectilinear (or so nearly rectilinear that 
the difference is absolutely imperceptible). 

It may be well to point out that if the visual ray from a star 
does not graze the solar sphere but passes at a distance r > 
from the centre of the sun, the deflection diminishes, being in 
inverse ratio to the perihelion distance r. This can be seen as 
follows. The expression (46) for 8 naturally holds for any star 
whatever which is visible from the earth, provided is replaced 
by the perihelion distance r. We shall thus have 

g 2a __ 2a Tq 

T ro r' 

The factor — has been calculated above, so that we have 
finally 

s = 1-7" X \ 
r 

Since Tq corresponds to an angle of 16', it will be obvious that 
if the angular distance from the centre of the sun is even a few 
degrees 8 will not be more than some hundredths of a second, 
and will therefore be totally imperceptible, just as if the ray 
were rigorously rectilinear. 

The angular displacements, if any, due to the sun become 
capable of observation during a total eclipse. A first attempt in 
this direction was made by the Lick Observatory in 1918, but 
the precision of the observations was insufficient for the purpose. 

For the total eclipse of 29th May, 1919, two simultaneous 
expeditions were organized by the Royal Society of London: 
one for Sobral in the north of Brazil, the other for the island of 
Principe in the Gulf of Guinea, both localities being within the 
zone of totality of the eclipse. The results of the observations 
made by these two expeditions can be summarized as follows. 
For the deflection of light the mean value of the displacements 
observed at Sobral gave 1-98", with probable error + 0*12"; 
at Principe the mean value was 1-61", with probable error 0*30". 
The deflection 1'76" predicted by Einstein’s general relativity 
lies between these two. This provided a new and striking con- 
firmation of Einstein’s theory, as the observed results were 
definitely incompatible both with the zero deviation of geome- 
trical optics, and with the deviation of half this value (0'88") 
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which would be given by the ordinary theory combined with the 
simple postulate that mass and energy are proportional. 

On the occasion of the next total eclipse (2l8t September, 
1922), visible in Western Australia, three further expeditions 
started for the zone of totality; the American one, organized by 
the Lick Observatory and conducted by Campbell, was the only 
one to secure any useful observations. But the available stars 
were rather far from the limb of the sun, and the deflection was 
therefore small; the results ^ show a wide dispersion, so that 
many astronomers do not regard their mean value as a further 
confirmation of the theory, although it is in almost perfect 
agreement with the Einsteinian prediction. 

11. Three-dimensional metrics with spherical symmetry. 

We shall begin by defining what is meant by saying that a 
metric manifold F3 has spherical symmetry round one of its 
points O. We shall follow the geometrical method suggested by 
Palatini®, considering along with the F3 an ordinary Euclidean 
space Fg in one-to-one correspondence with it. This corre- 
spondence being established, any point-transformation (T) of 
F3 into itself (in particular, a rigid motion of Fg) gives rise 
to an analogous point-transformation of F3 into itself. There is, 
however, no a jyriori reason that a rigid motion of F3 should 
correspond to a rigid motion of F3, a rigid motion of a manifold 
being taken to mean any transformation which J eaves dl^ un- 
changed, and therefore, in particular, changes geodesics into 
geodesics. 

We shall now say that a metric manifold F3 has spherical 
symmetry round one .of its points O when each of the 00® rigid 
rotations of F3 round the corresponding point O' determines a 
rigid motion in F3. 

Some important properties of the metric of a F3 with this 
property follow easily from the definition, subject naturally to 
the obvious condition that the metric (i.e. the coefficients of dl^) 
is regular in the region round every point, except possibly the 
point O. It can at once be shown that to any ray j' drawn from 

* Published in tiie Lich Observatory Bulletin^ No. 84(1, 1923. 

2 Of. “ lio spostamento del perielio di Mercuric, ecc,” in Nuovo XIV 

(1917), pp. 12-54. 
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O' there corresponds in F3 a geodesic j drawn from O. Thus, let 
JP' be any point on f which is not O', P the corresponding point 
(which is therefore not O) in F3. Let g be the geodesic in F3 
which is tangential to j P\ from the qualitative hypotheses of 
the case it follows that g exists and is unique. We have to show 
that g coincides with j. 

Consider in F^ the 00^ rotations which have f for axis: 
these correspond to 00^ rigid motions in the space F3 which 
leave fixed all the points of and only these. If we suppose that 
g is distinct from the effect of the oo^ rotations roimd f would 
be that g would occupy a simple infinity of positions, retaining 
in each the properties of being geodesic and tangential to j at 
P\ we should therefore have an infinite number of geodesics 
drawn through P in the same direction, which is impossible; 
hence g must coincide with j. 

An obvious deduction is that to any spherical surface S' 
with centre O' there corresponds in Fg a geodesic sphere S with 
centre O. 

Now consider any pair 2 ], S' of these surfaces, and the corre- 
spondence between the points Q of one and Q' of the other 
determined by the correspondence between the two spaces. We 
wish to show tliat the correspondence between Q and is 
conformal. 

Let da' be a generic line element in S' drawn from Q\ da the 
homologous element drawn from Q, If we suppose the Euclidean 
space referred to polar co-ordinates r, By <f>y we shall have 

da'^ — r'^{d 0 ^ + siii® 0 d(ffi) 

where r = 0 'Q\ Further, when r, 0 , are known, they deter- 
mine Q\ and therefore also Q, from the one-to-one correspondence; 
6 and ^ can therefore also be regarded as curvilinear co-ordinates 
of Q on S, and the line element da^ corresponding to da' (i.e. to 
arbitrary differentials dd and will in every case be repre- 
sented by a quadratic form which we propose to find. 

Consider two elementary arcs da' of equal length, drawn 
from Q' in two different directions. The two homologous arcs 
da will also be equal. For the two arcs da'y being equal in length, 
can be obtained from one another, by a rotation round O' Q'\ 
hence we infer that the two arcs da can also be obtained from one 
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another by a rigid motion in F 3 , and are therefore of equal length 
with respect to the metric of Fg, 

It follows that the ratio ^ is the same for the two directions 

axr 

considered, or, in other t<irms, that this ratio is the same whatever 
the differentials dd and d<j) may be. It is therefore a function 
H of position alone, i.e. (a priori) of r, 0, but it will at once be 
seen that this function must be the same whatever may be the 
point Q' of 2 ' considered, since we can always pass from one 
O' to another by a rotation. We can therefore put 

da^ - H^da'^ 


where H denotes a function of r only. 

For what follows it is perhaps advantageous to replace the 
co-ordinate r (the radius vector in F 3 ) by a function R{r) rlefined 
by the equation 

R(r) ~ H{r)r (48) 


The square of the line element of the geodesic sphere 2 thus 
takes the form 

da^ R^(de ^+ . . . (49) 


this gives us the geometrical significance of /?, no longer in the 
auxiliary Euclidean metric, but directly in F 3 . In fact, the 
expression (49) for da- is tJiat for a sj^here of radius R in ordinary 
space, and as such (cf. § 7, p. 240) has Gaussian curvature 


K =- 


1 


this curvature, from its intrinsic nature, belongs to 


any surface whose line element is given by (49), and therefore, in 
particular, to our surface 2 , 

We can therefore attach the following significance to the 


co-ordinate R: 


1 

IP 


represents at any point the Gaussian curvature 


of the geodesic sphere with its centre at the centre of symmetry 
O and passing through the point. From the property of symmetry 
it follows at once that all the geodesics drawn from 0 cut the 
sphere 2 orthogonally; hence if we denote by dg the elementary 
arc of one of these geodesics, the dP of F 3 can be represented 
in the form 


dP = dg^ -f da^i 
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and since dg depends solely on R (also from symmetry! we can 
put 

dg = A(R)dR, 

where ^ is a function of R, a priori undetermined, so that we 
get in consequence, with the help of (49), 

dV^ = AHR!^-\- R^{de^+ s\n^ed4>^). . , (49') 

This is the most general expression for the dl^ of a Y 3 which is 
symmetrical round a point.^ 

It is not without interest to show that every F 3 of this kind 
can be conformally represented in Euclidean space. It will be 
sufficient to show that we can determine two functions H{r) 
and r{R) such that we have identically 

A^dR^ + R{d0^ + sin® d d<f>^) ■--- 7/2 {dr^ + 7^{de^ + sm 20 rf(^ 2 )}; 

the necessary and sufficient conditions for this are 
Hr == R, Hdr = AdR, 


and therefore, eliminating 77, 



. ( 60 ) 


When .4 is a known function of R, this determines r, except 
for a constant multiplier, which from the strictly geometrical 
point of view remains arbitrary. The modulus H of the conformal 
transformation is then defined by 

H = - (51) 

r 


We shall now calculate Ricci’s s 3 nnbols (Chapter VII, 
p. 199) relative to a metric of this kind. We again make use of 
the property of symmetry, noting that an obvious consequence 
of the considerations set out in § 12 , pp. 201-208 is that if the 
quadric which determines the local distribution of curvature has 
an axis of symmetry, this axis gives one of the three principal 
directions, while the other two are indeterminate (i.e. may be 

^ This formula had been Kiveii as early fis 1896, from analytical considerations 
based on the theory of ^oups. Of. Atti della Jt* Acc* dci Lmodj Vol. V (second 
half-year, 1896), pp. 164-171. 
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any pair of directions orthogonal to each other and to the axis 
of symmetry). In our case, a point P distinct from O having been 
fixed arbitrarily, and the behaviour of every metric property 
being symmetrical round the geodesic g which joins O and P, 
it follows that the quadric of curvatures at P is necessarily 
symmetrical round the direction of g. Hence at every point 
our co-ordinates r, 0, <f> give principal directions of curvature, 
from which it follows at once that in the quadric of curvatures, 
and therefore in the tensor the product terms are missing, i.e. 

= 0 for i =4= k. 

In addition, if is the principal curvature corresponding 
to g, the other two curvatures a>2, oj.^ are equal to one another; 
we shall denote their common value by co. 

We may now recall formula (47) on p. 207, viz. 

^Uc \i^h\Jr^ 

which gives explicitly all the a’s as functions of the curvatures 
and of the moments of the principal lines. Since these coincide 
with the co-ordinate lines, along which vary R alone, 6 alone, 
and <l> alone, respectively, they will have for parameters 


Ai: 

11 

~ 

1 

A’ 

A? 

= 0, 

A'‘ = 

= 0 ; 


-- 0, 

Al 


de 

1 

A? - 

0 ; 





di '' 

” ij’ 



K 

-- 0, 

A? 


0, 

Al = 

d<^ __ 

1 

0 






di 

it sin 0 


and therefore the moments will be 

Ai,i A, Ai|2 = 0, Ai|3 0; 

A^ji — 0, A212 = Ry A^is -- 0; 

Agjj =rr-T 0, A3j2 0, Ajjjg — R sin0. 

Substituting in the formula quoted above, and putting 

0^2 = <*>3 “ we get 

a22 = R^o), Ugg = 022 (^2) 

— 0 (i 4= 
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The equations (53) have already been obtained from the 
consideration that in our case the principal lines of curvature 
coincide with the co-ordinate lines. 

We shall now calculate explicitly the value of a> at a generic 
point P from its definition as the Biemannian curvature. Prom 
symmetry, it can be considered as belonging to any geodesic 
surface whatever with pole P and containing the direction R. 
We shall show that the surface ^ — constant is a particular case 
of such a surface. Take the differential equations of the geodesics 
in our F3 (of line element dl), not, however, in the form given 
in (47) on p. 134, where they are solved for the variables, which 
would require the calculation of Christoffel’s symbols, but 
in Lagrange’s parametric form, starting from the Lagrangian 
function (the vis viva) 


.dl^ 

^di^' 


In the case we are considering 

T = 122(^+ sin8 0<^2)}. 


(where a dot over a letter denotes differentiation with respect to 
the parameter t), and therefore 


dT 

ar 


— 72 ® 

0 . 


From Lagrange’s equation for the angle viz. 

d dT _dT ^ ^ 
dt 0^ dd> 


it follows on integrating that one of the equations of the geodesics 
has the form 

72® sin® d<f3 = constant. 

From this it follows that if a geodesic issuing from P touches 
initially the surface ^ --- constant (so that ^ = 0 at P), 
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vanislies along the whole geodesic, which therefore belongs to 
the surface <f> ~ constant passing through P, as we wished to 
prove. To find a>, we have therefore to find the curvature of the 
binary differential form 

A^dli!^+ R^de^ (54) 

which expresses tlie square of the line element of the surface 
<f> = constant. 

The general expression for this curvature is 

CO = 

a 

(formula (28), p. 194); as our a is it only remains to cal- 

culate Riemann’s symbol of the first kind, (12, 12) by means 
of formulae (3) and (5) of Chapter VII. The explicit expression 
for this was formed by Gauss, and is given in all treatises on 
the subject. We thus get 

" ~ AR dR \a) 2R dR \A/ ' 

For the curvature we find 

Ml U 

oil — 11-“ — r. 

^ 222 ( A^f 

An independent calculation of these expressions is given in 
the following section. 

12. Digression on the calculation of curvatures. 

While our specific object is the calculation of co and we 
may here, for the convenience of the reader, show how the 
explicit expression for the curvature of a binary form as a function 
of its coefficients can be obtained without calculating Christoff el's 
symbols.^ We shall start from the geometrical property of the 
curvature expressed by formula (29') on p. 195, viz. 



’ Cf. F. Sbrana: della R, Acc. del Lhiceif Vol. XXXIII (second half- 

year, 1924), pp. 236-238. 
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4isr 

where DT denotes the area of an infinitesimal circuit T con- 
taining P, and € represents the angle of parallelism. In order to 
reduce the calculation to a minimum, we shall calculate e with 
reference to a dl^ of orthogonal form, of the type 

Edx^^+ Gdx^ (56) 


If on leaving P the direction X which is being displaced makes 
an angle a with the co-ordinate line its parameters A^, 
are plainly given by 


cosa _ sina 


(56) 


(cf. §§ 4 and 7, pp. 92, 98). Now consider an infinitesimal dis- 
placement SP, of contravariant components 8x^, Sxg; we know 
that when X is given a parallel displacement along 8P the incre- 
ments SA’' of its parameters are given by 

2 

8A' - S,, { jl, i} X^8x, (i ^ 1, 2) (57) 

(formula (23), p. 110). In order to avoid the necessity of calcu- 
lating the coefficients on the right-hand side (Christofiers symbols), 
we note that the equations 

2 

[jh 

I 

of the geodesics have on the right-hand side quadratic forms 
whose coefficients are precisely the symbols we need. Further, 
the first of the Lagrangian equations of the geodesics corre- 
sponding to the form (57) (the equation relative to is 

d dT _dT ^ ^ 
dt dx^ dx^ 

where T — \{Ex^ Gx^)y 


or, performing the differentiations and solving for 




_/^i 

{2E 




2E 


ai^® + 


^2 ■ ■ 1 
i ^|« 
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where 6?^ represent derivatives of E and Q with respect 

to and 2 ^. Comparing this with the first of the equations 
(57), we see that the latter can be written 


8A^ = 


= -A>SlogN/«+A*(-^8x,+ ^S,i). 


But from (66) we get 


SAi = - y ^ A2 Sa - Ai S log ^E, 


and substituting from this in the preceding equation there 
results 

8a =- - ^ {E^hXy - (7. Sa;-). 

^s^EG 

The angle of parallelism is obtained by integrating Sa round 
the circuit T, Replacing the line integral by a surface integral 
in the usual way (for the signs, cf. footnote, p. 190, Chapter VII) 
we get 

Noting that the field of integration reduces to the infinitesimal 
element 

DP ~ EG dx^ dx^, 

we can write (neglecting infinitesimals of higher order) 

c = - ^ [ -L (-^)+ ® 

2*s/eG WEO/ WEG/J 

This gives the required expression for the curvature, viz. 

ir=- 1 / 

'2^ EG 19 WEG/ WEG/ J 
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For E = \ (for which the lines are geodesics) we get in 
particular the formula 

£: = — _1 
s/G 

which is frequently used in the theory of surfaces. 

For the line element given by (54), putting — R, ~ 6, 
so that E — A^{R), G — R^, the curvature K becomes 

- - - (h)- •••<“) 


as stated in the preceding section. 

We now come to the calculation of crij, the curvature corre- 
sponding to the section normal to the lines R. It is to be noted 
that the spheres R — constant, unlike the surfaces <f> = con- 
stant, are not geodesic surfaces, so that co^ does not coincide 


with the Gaussian curvature 


1 

jR2 


(cf. § 11) of these spheres. 


To 


calculate it, instead of using the direct definition it will be more 
convenient to use the property that dl^ can be conformally repre- 
sented in a Euclidean space, with 


dl^ = mdio^ 


as we have already seen. 

In § 4, p. 228, we found the explicit form of the relations 
between homologous Rieinann’s symbols for two line elements 
ds and ds' for which 

ds^. 

We shall identify ds with our dl^ and ds' with the Euclidean 
we can then apply formulae (18) of p. 231 by making the symbols 
marked with a dash vanish (since they refer to the Euclidean 
dl^^) and putting 

T = — log H (59) 

The formulae then become 

{ij, hk) = — TjTj.) — — t^t*) — — t, t,,) 

+ — T- T*) + (aj, — a^c aj/,) At, 
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where the coefficients, the covariant derivatives, and the para- 
meter A are all taken to refer to 

dl^ = A^dll^ + R^ide^ + sin2 e d<f>^). 

Multiplying these formulae by and summing with 

respect to the four indices, the left-hand side, by formulae (1) 
and (2) of § 2, gives the linear invariant G relative to our V», 
which after some obvious reductions is thus expressed by 

G — 4 A 2 T — 2At, 

3 3 

where At == (Chapter VIII, p. 231); and 

A 2 T -= (Chapter VI, p. 154). 

1 

But for a the linear invariant G is equal to — (cf. 
§ 4), so that the mean curvature is in our case given by 

Jil ” 2 A 2 T + At ((50) 


As (the sum of the three curvatures) = coj 2a>, and a> 
has already been calculated, this formula will give the required 
expression foroj^; it remains to find the values of the quantities 
AgT and At on the right, using for this purpose the formulae (50), 
(51), and (59). 

From the general expression 

3 

At 

1 

we have for our dP, and for a function t which depends only on R, 


At 


1 


r -'2 


the dash here also denoting the derivative with respect to R, 
Further, from the general expression (18) on p. 154 we have 

A2T = - (x/ar’), 

Va 1 

and since now ^ a AR^ sin0, it follows that 
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In our case, from ( 59 ) and ( 51 ), 



T = 

= — 

log^r = 

s 

II 

r 

W 


and by ( 50 ) 


d . 

A 

R’ 



hence 


/ 

T 

_ ^ — 

1 






R 




It follows that 







At = 

(A- 

- 1)^ 

1 

(1- 

. 1 ) 

2 

1 ^ 


A^ 

m 


v 

a) 


AgT -- 

1 

AE?- 

d 

dR 

[/i(i 


'] 



1 

6 

- L) - 

1 

d 

rv 


AR^ 

\ 

a) 

‘ AR 

dR 

\Ar 


using the expression ( 58 ') for a>, this can also be written m the 
form 


— 


- (i-M 

AK^ \ a) 


V ca. 


Substituting in (60) the expressions just found for At and 
AgT and for its value wj + 2cu, 2ct> cancels out on both sides, 
and we get forojj the value stated at tlie end of § 11, viz. 




1 



( 01 ) 


13. The gravitational equations in the case of spherical sym- 
metry. Schwarzschild’s rigorous solution. 

We shall now apply the equations of the Einsteinian statics 
to the particular case of a single attracting mass, or more generally 
of a distribution of masses having spherical symmetry round a 
point O. Using the terminology of § 11 we shall deal with matter 
distributed in accordance with any law dependent only on 72 in 
layers bounded by geodesic spheres of centre O. The Einsteinian 
will have the statical form 

where dl^ will necessarily be of the type ( 49 '), and F, from 
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symmetiy, will also depend only on R. We shall agree to consider 
only regions outside the field occupied by the attracting masses. 
In these regions the statical equations ( 21 '), ( 22 ) of § 4 for empty 
space will hold, i.e. 

= 0 (21') 

0 ( 22 ) 


Since Jid denotes the mean curvature of the symmetrical F 3 , 
i.e, the sum coi + 2oj of the three principal curvatures, ( 21 '), 
together with (58) and (61) of the preceding section, gives 



1 d /ly 
R dR \a/ 


- 0 ; 


whence on separating variables and integrating 



(62) 


where a denotes a constant of integration. 

It is to be noted that whatever the constant a may be the 
expression found satisfies the physically necessary condition that 
at an infinite distance from the attracting masses the metric 
tends towards the Euclidean form. In fact, if R — - 00 , A 
so that the dl^ (49') becomes the ordinary Euclidean expression 
in polar co-ordinates. 

The symbols are then completely definc'd by the formulae 
(52) and (53), where a> and a)^ have the values (58) and (61). 

In order to put the gravitational equations ( 22 ) in an explicit 
form, we must again replace the covariant derivatives by 
the ordinary derivatives. This can also be done without any pre- 
liminary calculations, as follows. Let the aj/s denote generic co- 
ordinates in a space with a generic metric. Take a function F 
of the x’s, and consider its variation along a geodesic line along 
which the x’s are considered as functions of a parameter t. We 
shall have in the first place 


dV 

dt 



Differentiating again, and substituting for the x/s their values 
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as given by the equations of the geodesics, we get, as a particular 
case of the notion of covariant derivatives (Chapter VI, §§ 1 
and 2, p. 144), „„ a 

^ ~ ^ ik (63) 


Further, assuming in particular =22, ^ 6, 

and remembering that our F is a function of 22 only, we have 

"Z == rit 

dt 

(dashes denoting derivatives with respect to R), and 

“ ' ^ V'R^ -I V'R. 


But for our metric, i.e. for 

T + i22(02 + 

the equation of the geodesics for the co-ordinate R gives 

ddTdT ^ 
dl dit 

or A^R + i^- I - R{l^ -1- 8in2 6 4^) = 0, 

a 22 dli 


whence we get 


R = Z (^ -f sin* e 4^). 

A 


Using this result the foregoing expression for becomes 

f j = - - f ) + ^2- ^ 

This expression, like (63), must hold along a generic geodesic, 
i.e. for arbitrary values of the quantities db^ = 22, ±2 — 0, 
±2 = <f>. Comparing them we get 
V'A' RV' 

F,, = F"- Zjl, F,, = . F33 - V^nin^d, 

Vuc= 0 (*■ 4 - k). 


(64) 
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Substituting in the gravitational equations (22) these values 
for the and the values (52) and (53) for the we see 

at once that the equations with two distinct indices reduce to 
identities, those for the pairs of indices 11 and 22 take the form 


.2 , V" V'A' . 

^ “>+ r ~ -VA “ 


(65) 


f 


VA^ 


0 , 


and the lemaining equation is the same as this last one. Bub- 

A' 

stituting in this equation for co its value (58), i.e. . it becomes 

IV A 


A' F' 

_ 4- , 0 

A^ V 


(66) 


or 


AV == constant. 


At an infinite distance from the attracting masses the 
Einsteinian ds^ must reduce to the pseudo-Euclidean form, and 
therefore the coefficient F (the Romerian velocity of light) must 
tend to 1 like A; hemce the constant must have the value 1, 
and we have 

^F = 1 (66') 


This equation and (62) give A and F in finite terms, so that 
the required ds^ is now completely determined. The equation 
(65) remains to be considered, but it will at once be seen that 
with the values (62) and (66') it reduces to an identity. In fact, 

substituting for its value ^ ( 1 ““ L) t >2 

A' V' 

for -j, by (66), the (!quivalent — - multiplying by F®, and 
A V 

remembering once more that AV = 1, (65) becomes 
1 


pr 


7?2 

1 


(1 - F2) + FF" + F'2 = 0, 




™ (1 - F2) + J -r_ F2 = 0. 
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On substituting for F* the value — -i = 1 — ' we find 

that this equation is satisfied identically, which proves the 
required result. 

The rigorous form of the Einsteinian with spherical sym- 
metry is therefore 

ds^ = (1 - -dP . . . . (67) 


with 


dl^ = 


dR^ 


a 

R 


.+ R^ {de^- + ^m^edft>% 


This expression for ds^ was first given by Scljwarzschild.^ 
The metric contains a constant a which is a priori arbitrary; its 
value can be deduced from a consideration of the intensity of the 
field of force at great distances from the attracting masses. In 
these regions the spacelikc dl^ tends, as we know, to become 
Euclidean, R becoming identical with the length of the 
radius vector drawn from the centre of symmetry; further, the 
expression 


represents the potential of the field (cf. Chapter XI, p. 328). Com- 

fM 

paring this with the classical Newtonian expression for the 

R 


potential due to a mass M concentrated at the origin (or sym- 
metrically distributed round it in any way), we see that we must 
put 


a 


2/M 
6-2 ’ 


( 68 ) 


where M is the sum of the attracting masses. 

It follows from § 1 1 that every dP with spherical symmetry, 
and therefore in particular the Einsteinian dl^ (67), can be con- 
formally represented in a Euclidean space, the modulus of ^he 


^ Sitzurigsherichte der Prcuss. Akad. der Wisa.^ 1910, pp. 189 190. 
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R 

confonzial representation being — , where t is defined by (60), 
i.e. by ^ 

dr AdR 

7 R ’ 

U^g the relations 



we have to express the right-hand side of this equation in terms 
of V, which gives 

dr 2dV 


y2’ 


whence on integrating 


r = r, 


1 4 - V 
l-V 


= r •= (1 4 - Fl* 

® 1 — F2 “a ^ ^ ^ ’ 


(691 


where r 0 denotes a constant. 

If we wish to impose the natural condition that r, like R, 
shall tend to become identical with the ordinary radius vector 
at an indefinitely great distance from the attracting masses, 
we shall have to determine in such a way that 

lim ^ = 1 . 

R— oo r 


Since when ^ oo, F — ^ 1 , this gives 


^0=1- 


H = ? = _ 4 

r (1 + F)2’ 


Consequently 
and therefore dP = r=: 


d( 2. 

(1+F)^ ” 


As an instructive example, we shall apply these rigorous 
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formulae to calculate over again, for a symmetrical field, the 
expression (33) of § 6, viz. 

= (1 — 2y)da:o2 — (1 + ^)dl* 


^where y stands for which to a first approximation gives the 

Einsteinian da® corresponding to an assigned Newtonian field. 
In our case comparison of the coefficients Y® and 1 — 2y of dx^ 
gives rigorously 



BO that, from the value (68) of a, V is precisely the expression for 
the Newtonian potential of a mass M symmetrically distributed 
round the centre. Comparison of the coefficients of dl^ imposes 
the condition (at least to a first approirimation) 


1 + 2y = ^ 


16 

(r+F)^' 


Prom the expression 1 — 2y for F® we have to a first approxi- 
mation V — 1 — y, and therefore 

(1 + F)-* = (2 - y)- - 1 (1 + 2y), 


which ensures that the above condition is effectively satisfied so 
long as we neglect terms of higher order. 


14, Spatially unilorm metrics; their cosmological interest* 

We shall now examine whether there exist solutions of the 
gravitational equations in statical conditions, and on the hypo- 
thesis that the spacelike dP has a constant curvature K and that 
the energy tensor is also uniform, meaning by this that it is 
of the type (66) of p. 358 (applied to the statical case). This 
is equivalent to assuming for the Ti.’s the expressions 

Too = -P)=y% .... (70) 

T it ~ P^ik ih k = 1, 2, 3), . . (71) 


where the obviously denote the coefficients of rfF. The two 
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quantities >7 ( > 0 ) and p represent respectively (cf . Chapter XI, 
p. 358) the energy density and the pressure (or pull, if jo < 0 ) in 
the medium. 

We next take into account the geometrical hypothesis that 
the spacelike manifold has constant curvature K. When the 
three principal curvatures cog, a >3 all reduce to X, the canonical 
expressions for Ricci’s symbols (formula (47) on p. 207), 
together with (46) on p. 206, give immediately 

^ - . . . • { 72 ) 

while by the definition of the mean curvature we have 

^ 3K (72') 

Using these results, the first of the gravitational equations, 
(18) of § 4, becomes 

Kr^ (73) 

We deduce from this that A" > 0 , which comes within the 
general observation of § 4 that in statical conditions the mean 
curvature is always either positive or zero. The equation (73) 
then shows that rj is necessarily constant when K is, or in other 
words that the medium must have a uniform distribution of 
energy, or, what is the same thing, of matter. 

On account of this circumstance, this type of solution has a 
particular cosmological interest. Tt is true that the celestial 
bodies are separated by distances which are large compajred with 
their dimensions, and therefore the distribution of matter in 
space is essentially discontinuous; but from a statistical point 
of view it is natural to ask what are, so to speak, the mean 
mechanical conditions of the universe; i.e. what would be the 
nature of the space-time metric on the hypothesis that the whole 
of the cosmic matter, instead of being concentrated in discrete 
masses, is uniformly distributed throughout all space, with the 

mean density ^ of the actual distribution. 

It is important to note that, as we are dealing with a space 
of constant positive curvature, its extension S (in the sense of 
Chapter VI, p. 160) is^finite, as we shall show in a moment. As- 
sociating it in the meanwhile with the foregoing cosmological 
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consideration, we reach the conclusion that in this type of solution 
the total quantity M of matter is finite, and is given by 

^ (74) 

In order to find the extension B, we take dP in the canonical 
form (31) on p. 240, viz. 

+ ^y^ + • (75) 

'Ur 'Ur 

K ® 

where M =- 1 + r® and r® = ^ 

4 1 

We have in the first place, for the element of volume corre- 
sponding to the Euclidean referred to polar co-ordinates 
r, e, 4>, 

dS„ == dr s,m6 d0 dtf}, 

and therefore, for the corresponding element of physical space. 


The total volume is in conseqiience given by 


(dS^ 

J ,,3 ’ 


the integral being extended to the whole of space. The integration 
with respect to 6 and <f> gives 47r, so that we can write 


” *"l, 


Here we can introduce the radius a of the sphere of Gaussian 

1 T 

curvature K, putting K — and substitute x — - for r 

a® 2a 

as the variable of integration. This gives 

8 = 32ira^f ~ 27j^a*, 

y 0 (1 + a«)» 
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and therefore, from (74), 

M = 

c* 


In the given conditions, physical space has thus the volume 
2rF^a^, and is therefore finite, though at the same time unlimited. 
This latter property holds, as for ordinary two-dimensional 
spherical surfaces, for any manifold of constant positive curvature 
in any number of dimensions. 

Another general property which calls for mention is that in 
a variety of the kind specified the geodesics are all closed lines, 
of length 27ra. Consider specifically the case of three dimensions 
which corresponds to the physical space of the problem under 
discussion. It will be seen immediately that without loss of 


generality we can always refer dP 


to polar co-ordinates 


r, 0 , ^ in such a way that for a geodesic assigned in any manner 
^ = 0 at one point of it; from the Lagrangian equation relative 
to the parameter (f> it then follows, as in § 1], that ^ = 0 all 
along the curve, which is therefore a geodesic of one of the sur- 
faces <f) = constant, or in particular, by suitable choice of the 
(f> - axis, of = 0. In view of the transformation formulae 
between Cartesian and polar co-ordinates, 


^ r sin0 co8<^, 

rrrr /* siu^ sin<^, 

y^ -- r COS0, 


this is equivalent to saying that any geodesic can always be 
considered as belonging to the co-ordinate plane y^ — 0; but, 
for 5^2 ^ 0, dP assumes the canonical form of a two-dimensional 
manifold of constant curvature K, i.e. of the ordinary sphere of 
radius a. The geodesic therefore coincides with a great circle 
on this sphere, and is therefore a closed curve of length 27ra. 

We now pass on to the other six gravitational equations. 

Taking account of (71) and (72), the equations (17) become 

+ [k +KP- 5^1) a., = 0 (*, A; = 1, 2, 3), (77) 


which can be satisfied in two different ways, according as we 
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suppose V oonstant (Einstein’s cylindrical 8pace*time) or F a 
function of position (De Sitter’s hyperspherical space-time).^ 


15. Einstein’s solution. 

First, suppose V constant. In this case it is necessary and 
sufficient to add to (73) the condition 

K-\-kp 0 (78) 

From this it follows first of all that the normal stress p is 
necessarily the same at every point, and on comparison with (73) 

there follows , 

P == -H (73') 


whence we get the following result: 

In a homogeneous medium subjected to a uniform pull of 
Irj, 7j being the energy density, the space assumes the constant 


positive curvature K ~ 


- 7], the velocity 


V of light remaining 


constant (and being naturally supposed not zero). 

Remembering that in statical conditions the potential of the 
force in the field is — (Chapter XI, p. 328), we see at once 
that in the present case the force is zero. 


IG. De Sitter’s solution. 


Now^ suppose that F is a function of position. Multiplying 
(77) by and summing with respect to i and k we get in the 
first place 


A,F 




4- SIK + kp 




)-o 


or 


= UK+kp). 


The equations (77) are therefore equivalent to 

.... (77') 

where for brevity we have put 

K* = K — i{3K Kp). . . . (79) 

I Of. T- LiKVI-Civita : “Realta fisica di alcuni spazi nomiali del Bianchi,*’ in 
Jtend, della Jt Ace, dei Linceit Vol. XXVI (first half-year, 1917), pf). 519--581. 



430 


ABSOLUTE DIFFERENTIAL CALCULUS 


It is easy to see that the equations (77') are mutually con- 
sistent for V not constant (in fact, they constitute a complete 
system with respect to F considered as the unknown function) 
if, and only if, K* = K, To prove this, take the commutation 
formula (20) on p. 186 for the second covariant derivatives of 
a simple system F^, which gives 

1 

Substituting for Rieraann’s symbols of the second kind the 
expressions for a manifold of constant curvature K (formula 
(19') on p. 234), viz. 

^ i^ih S/r ^ik K)> 


we get F,a* — -- jK (a,;, F„). 

Further, multiplying (77') by F and taking the covariaiit 
derivative, we get 

.... (77") 

substituting in the preceding equation, we get the conditions of 
integrability 

for every set of values of the three indices i, li, k» Since by hypo- 
thesis F is an eft’ectivc function, one at least of its derivatives 
(say F;^.) will not be zero. In the above equations take this value 
of h and a value of h different from multiply by and sum 
with respect to i. This gives 

{K^-K)V, - 0 

whence /!'**' — — 0, 

Q. E. D. 

Using this result, we get from (79) 

3iv + Kj? = 0, 

which leads to the same qualitative statements with regard 
to the stresses as those made above for the cylindrical space- 
time. 
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For the integration of the equations (77'), in which from now 
onwards we put if* — K, we must again take dl^ in the canonical 
form (75). 

The covariant derivatives Vij^ of F with respect to our dl^ 
can be found explicitly as functions of the ordinary derivatives, 
without direct calculation, from the considerations in Chapter 
VIII, pp. 222-232. In fact, considering our dP and the corre- 
sponding Euclidean dl^ referred to the same co-ordinates, we 
have, from formula (9) on p. 224, 


where, by (16) on p. 230, 

Pik = Si-T, — 


with in our case u having the value given in (70). 

Noting that for dl^ referred to Cartesian co-ordinates the 
derivatives F,^ are identical with the ordinary second derivatives, 
and tlxat t' -= t/, -- Sj, we have the required expressions 

in the form 




1 / T7 . T/ X S'J V T7 

(^^ 4 - F ^ + u, V 4 .) — — Ui V 
dy^dy,, u u 1 


Substitute these expressions in the equations (77'), which on 
multiplying by uV take the form 

«F,;.+ - 0 

u 

g4- 

for = K and a.. = Putting for brevity 

7 /^ 


W -= uV, . 

and using the expression (76) for w, we get 

cPV , T/ I V - ^ 

u + w* Ff + «,■ y k — 


• (80) 


KV 


^yi^Vk 


^Vi^Vk 




whence it follows that 

d^W _ ]F 

^Vi^Vk *'2 W 


KW 


U‘ 


U 3 \ 
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But by (80) 


V^ 


El 

U 


W 


substituting, and taking into account the definition (76) of u and 
the couseqxient identity 

K 1^1 = Ku, 

1 


the foregoing equations become 


d^W 




From this it follows at once that for i 4 = ^ the second deri-^ 
vatives of W vanish, so that W must be a function with the 
variables separated (the sum of three functions, one of alone, 
one of ^2 alone, and one of alone). Further, for i ~ k, the 
equations above show that as the terms on the right are the same 
for all three cases, we must have also 

dm dm dm^ 

"^Vx' 9 ^ 3 ^ 


all equal; their common value must therefore be a constant, 
which we can denote by feg , • Hence the most general expression 
for W is of the type 


W 


^ r® + w? + C, 
4 


where w is a linear homogeneous function, a priori undetermined, 
and C is a constant. The coefficients of this expression are to 
be so determined that 




i.e. that 


1 


Tf 


60 «• 


By Euler’s theorem on homogeneous functions the linear 
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term w contributes nothing to the left-hand side, so that its tliree 
coefficients are arbitrary; there thus remains 

^0 ^ , 2 _ (j ^ 

4 

which by (76) reduces to 

C - - 6,. 

Hence the final expression for W can be written in tlie 
form 

W - ^ 2) + t/;, .... (81) 

the constant b^^ and the coefficients of w being still completely 
arlutrary. This numb(»T of coUvSiants couki of course be predicted 
from the fact tliat tlu* system (77') is completely integrable; as 
all the second derivatives of the function V are defined by it, 
it is obviously equivalent (cf. f/hapter II, p. 13) to a total differen- 
tia] sy stem in four unknown functions, viz. V itself and its 
three first derivatives. 

It is also to be noted that the three constants of integration 
which appear in the linear expression 

w == 2(6i + 62 ^2 + hy-s) 

can obviously be reduced to one, since by a suitable orthogonal 
transformation applied to the y/’s (for which r^, n, and are all 
invariant), we can always reduce the trinomial to the form 2by^, 
with 6 == n/ j_ ^^2 X 

But we may also suppose b ~ 0; this can be formally proved 
(though in a less elementary way) by taking account of the 
homogeneity of a space with constant curvature, which enables 
us to take a point fixed in advance iis the point y, -- 0, wdiile 

dl 2 

retaining the canonical form ® for 

* This becomes intuitive for tlie case of two diiiienHLOiis, in which a ruanifoltl 
of constant positive curvature is an ordinary spher'i, and the canonical expression 
for dl^ is obtained by stereoj^raphic projection of the sphere on a diametral i>lau& 
(ef. Chapter VIII, p. 241). The assertion in the text reduces in this case to the 
obvious geometrical fact that any point whatever of the sphere may be chosen aa 
the centre of projection. 

( D 665 ) 


16 
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Using these results, it follows from (80) that the expression 
for the spatially uniform on the hypothesis of V variable, is 






where 


W = bo(u~2),^ 
K L 
“-1+4^ I 


(82) 


It is assumed that the constant is not zero, as otherwise 
we should have identically F ~ 0, which is not permissible, 
since we are considering the case of F variable. 

In view of the physical significance of F, those points, if any, 
at which F — 0 obviously denote singularities in the field; they 
remain, so to speak, optically isolated, in a sense which will be 
explained further on. On the other hand, as r, and therefore 
increases indefinitely, F tends to further, for finite values of r, 
u remains essentially finite and > I, no that the singular points 
are determined by the equation W — 0. This (equation, com- 
bined with (82) and the relation K — becomes 

r 2(f. 


which in the representative Kuclidean space defines a sphere 
Z)q. The surface 7> which corresponds to it in the ]>hysical space, 
and which, by § 11, is also a (geodc^sic) sphere, is called the horizon, 
because it constitutes in a certain scuise the limit of tlio j)erceptible 
universe. This follows from the fact that light, and a fortiori 
a material partichj, would take an infinite time to reach it. To 
]3rove this, let A and B l)e two generic points; then by the 
<lefinition of F the time taken by light to pass from A to B is 

J V J uV ' W 


where the integral is taken along the ray joining.^ to B. Wlum 
B tends to the horizon the integrand tends to an infinity of the 
first order at B, and therefore the integral cannot remain finite. 

As we have already several times recalled (in particular in the 
preceding section), the force in the field is the gradient of — JF^; 
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in consequence it tends to displace the material masses towards 
the regions of minimum i.e. towards the horizon. This 
circumstance was regarded as an incongruence of De Sitter's 
space-time; but it is to be observed that it must be taken to refer 
solely to accidental masses (sufficiently small not to modify the 
field perceptibly), and not to those uniformly diffused masses 
which constitute it, the equilibrium of which is automatically 
assured by the gravitational equations. 

It is interesting to remark that the problem of spatially 
uniform metrics (§ 14) admits of a solution which includes both 
Einstein's and De Sitter's solutions as particular cases.^ 

In fact, in the argument beginning at equation (77'), it was 
tacitly assumed, at (77"), that K* is constant. If we drop 
this supposition we find 

- A'* (a., V, - a,, V,) + V (a^K - Kh 
which, on f*ombinatiou as before with 

gives 

(K - K*) (aa V, - a,, V,) - V(a,,K: a,,K:). 

If we put a for A" — K*, this becomes 

V/, ~ a.h En — a, -ft E^) = 0, 

leading, by the same treatment as in the former case, to 

EVf^ ~\r V Ej^ = 0 , 

or AF = constant 

= A, say. 

Equation (77') may now be written 

+ == 0 . 

Thus, if instead of (80) we write 

^ This exteimion of the analyui^^ was suggested to me by Dr. John Dougall. 
rvObfji 15 * 
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the investigation proceeds exactly as before, and leads to the 
same value of W, viz. that given in equation ( 81 ). The new 
value of V is therefore 


F - 


A W 

k "u 

A , bo{u — 2 ) 

K'^ u 


If -4 := 0 we have De Sitter’s solution; if io ~ ® have 
Einstein’s. If both A and 6^, are different from zero, the 
curvature K is still constant, but the (normal) stress p is 
variable, being given by 

K - A * -- E 
_ A 

~ y 

or + 

We shall conclude this section by sliowing that iJe Sitter’s 
space-time not only, like Einstein’s, implies tliat physical space 
(i.e. any manifold — constant) has constant positive curvature 
A, but has itself, as a four “dimensional manifold, constant 
negative curvature. 

To prove this, we start from a known property of every space- 
like dl^ which has constant curvature K, namely (Chapter VIII, 
p. 234), that Riemarm’s symbols for d¥ have the form (19') of 
j). 234, or 

{ir, hk] K{a;,, S;; -- a,*. 8;,) {i, r, h, k = 1, 2, 3). 

By (11) and (13) of § 4, these relations can be written in the 
form 

\rr, kkf ’ — • • (83) 

still for the same values 1, 2, 3 of the indices. Now it is easy to 
see that these last formulae, in virtue of the expressions (14') 
for Riemann’s symbols for our ds^ and of the equations 

=_- _ Ka^ Kgj, {i, k 1, 2, 3), (77"') 
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will Btill hold when 0 is included among the values to be assigned 
to the indices. This is obvious when one, three, or four indices 
are equal to 0, since then (§ 4) both sides of the equation vanish. 
In the case of two indices zero we have, as in § 4, to examine the 
two types {Or, OA:}', (xO, Oi}'. The corresponding values of the 

left-hand side are respectively F(F* )^, — F F^^ and 

i.e. in view of (77'"), ^ 

-- K F2 a- a,, - ^ ^ Kg^ S;, Kg,,. 

1 

The values of the expression on tlie right are clearly the same. 
Thus the equations (83) hold for all values of the indices from 
0 to 3, which is precisely equivalent (still by formula (19') of 
p. 234) to saying that the ds^ of space-time has constant negative 
curvature — K, 

It may be well to observe that wliile the notion of a manifold 
of constant curvature and the measure K of this curvature are 
by their natun* inv^ariant, i.e. independent of the choice of the 
co-ordinates of refenmeo, this invariance does not persist for 
multiplication of ds^ by a constant factor m. In fact, when all 
the coefficients g^, are multiplied by ni, Riemann’s symbols of 
the second kind are unchanged, so that, again by formula (19') 
of p. 234, the curvature K is divided by rn. In particular, for 
m — 1, it changes sign. This explains the apparent contra- 
diction between our enunciation and that of some writers who 
take — rfs- as the fundamental form and assign constant positive 
curvature to Dc Sitter’s space-time. 

17. Eiinstein’s additional term. Indication of other rigorous 
solutions. 

For Einstein’s solution we found in §§ 14 and 15 (formulae 
(73) and (78)) 

= Kg, K + Kj} ^ 0. 

We cannot therefore suppose the matter devoid of stresses 
{p 0) without concluding that g -= 0, which brings us back 
to the uninteresting case of a totally emj)ty space. Now if 
we take the cosinologico-statistical point of view (in the sense 
indicated in § 14), it seems reasonahh* to suppose that there 
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must be a solution of the gravitational equations corresponding 
to the h 3 q)othe 8 is of a uniform distribution of matter which shall 
be so tenuous that the molecular actions between contiguous 
particles, and therefore the stresses, are imperceptible; such, 
that is, that p — 0 , while 17 is a constant other than zero. 

Since the gravitational equations in the original form ( 8 ), viz. 

have no solution of this type, Einstein was led to modify them 
(very slightly) by adding a term which maintains the tensorial 
character of the equations ( 8 ), and which in ordinary cases is 
completely imperceptible while serving to render possible a solu- 
tion of the t 3 ^pe indicated. This term was assumed by Einstein 
in the particularly simple form A denoting a constant which 
in most cases is negligible compared with G. The gravitational 
equations so modified are 

— \Ggik I 

{i, h - 0, 1, 2, 3). J • ■ ^ ^ 

The statical equations accordingly become 

M A K7), 

a** + + a) a^. - - kT^ {i, k^- 1,2, 3). 

Proceeding as in §§ 14, 15, on the hypothesis that the space- 
lihe dp has constant curvature, that the density is constant, and 
that the stresses are isotrojuc (i.e. are given by (70) and (71)), 
we ultimately reach the two equations 

— KY] X, 

K + Kp A, 

between K, t], p, and A, which take the place of (73) and (78). 

Here it plainly becomes possible to put p = 0 without r) 
necessarily having to vanish at the same time; we need only 
take 

2K 

K 

K - A. 
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To get an idea of the order of smallness of the constant A, 

we may note that the mean cosmic density - of matter can 

certainly be regarded as considerably less than that of the nebnleo. 
which is of the order of 10“^^ gm./cm.®. It is therefore legiti- 
mate to assume that in any case 

A if < 10-'^ 


From the numerical values (in C.G.S. units) k = 2 ,X 10“*®, 
= 3 X 10’®, we have 


A V- A'<9 X 10- 


For the radius a of the universe 


i- - i) 


we thus get a lower 


limit given by 


a > 1()22 cm. 


This radius is therefore certainly considerably greater than 
km. or 10,000 light-years. 


We shall conclude with some bibliographical references con- 
cerning the rigorous solutions of the gravitational equations (with 
or without the cosmological term) in some special cases. 

Schwarzschild’s solution is supplemented or generalized in 
various important respects by the original contributions of 
Birkhoff, De Bonder, Eddington, v. Laue,^ and Weyl, which 
are given in their respective treatises, and of Signorina Longo®, 
TreflFtz,® Nuyens,^ and Vanderlinden.^ 

A different type of solution is coi^sidered in the researches of 
Weyl,® Levi-Civita," Bach,® Chazy,® Palatini,^® and Kasner.^^ 

^ Of. also SitzungmhcrieUiv dcr J^reunm. Al\ der M 19213, pp. 27-31. 

2 Nuovo CiinentOf Vol. XV, 1918, p{>. 191-211. 

s ^falh. AnnaUn, Vol. 86, 1922, pp, 317-326. 

^ Comptes Jlrndus^ Vol. 176, 1923, pp. 1370“1379. 

® Bull, de VAc. rnyale de Belgique^ 1921, pp. 260-276. 

« Annalcn drr Phynik, C,4. (1918), pp. 117 14.5; 59 (1919), pp. 18.5-188. 

^ “ ds^ einstoiniani in ojimpi newtoiiiani Xotes I~TX, in Fiend, della /?. Arc. 
dei Linceu Vols. XXVI, XXVTI, XXVIII, 1917-1919. 

^ Mathematische Zeitsekrift, Vol. 13, 1922, pp. 134-145. 

^ BuUetlu de la SorMe Math, de France^ Vol. LII, 1924, pp. 17 37. 

Nuovo Cimento, Vol. XXVI, 1923, pp, 5-24- 

Transi. of the American MatK, Sociciy^ Vol. 27, 192.5, pp. 101-1^5, 155-162. 




ADDITIONAL NOTES 


P. 122, line 9 from foot. See a short but substantial article by 
E. Cartan, who discusses the question exhaustively from the 
geometrical j)oint of view; Annales de h Soeieie Polonaise de Mafhe- 
matitjiies, Vol. VI (1927), pp. J-7. An earlier paper by M. Janet, 
ibidem, Vol. V (1926), pp. 38-73, may also be consulted. 

P. 168, at the end. A luminous demonstration of M. Fermi’s 
theorem, as simple as it is intimately related to fundamental prin- 
ciples, has been given recently by Mlh*. P. Nalli: Rend. Acc. Liruvi, 
Vol. Vir (1928), pp. 195-198. 

P.\l],at the end. The use of locallv geodesic co-ordinates enables 
us to recognize at once an important prope.rtif of the e-systems, which 
they possess in common with the fundamental tensors a,*., a'*" 
— their comriant derivative vanishes identically. For each element of 

an €-8yst4'.m is either zero or of the form i^a, . The deri- 

ve 

vatives of the a,fr's being zero (in geodesic co-ordinates, for the 
point considered), the .same is true for cv(*ry element of an c-tensor. 
It follows (p. 71, final })aragraph) that the covariant derivative 
vanishes in any system of co-ordinates whatever. 

P. 188, line 3. The general ca.se in which the cycle T and conse- 
quently the area F are not restricted to be infinitely small, can 
also be treatexl without great difficulty, as has been shown very 
ingeniously by J. M. McConnell, Rend. Acc. Lined, Vol. VII (1928), 
pp. 208 213, 306-309. 

P. 209, end of footnote. See also J. L. Synoe, On the Geometry of 
Dynamics, Phil. Trans. Roy. ?loc., A, 226 (1926). pp. 31-106; and 
various notes by MM. Berwaljj, Boooio, Cartan, CRrDELi, 
De Mira Fernantes, ONK’K.sru, VRANrEAM% Rend. Arc. Lined, 
Vols. V, VI and VII (1927, 1928). 

141 
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P. 228, after formula (14). Formulas (13) and (14) can be proved 
more readily, without any formal development, by means of geo- 
desic co-ordinates, as has been remarked by Mile. Nalli. See her 
note Dm dimostrazioni nd calcoh assolvio, Boll, dell’ Unione Mat. 
Italians, Vol. VII (1928), pp. 124, 127. 

P. 234, line 10 from fool. A simpler proof, due to Mile. Nalli, 
is given in the paper cited in the note to p. 228. 

P. 439, at end of references. On all these questions, Dabmois, 
Les equations de la gravitation eineteinienne, Fasc. XX, Memorial 
des Sciences Math^matiques (Paris, Gauthier- Villars, 1927) may 
also be consulted. 
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Action, in mechanics, 331. 

Action, stationary, 324, 331. 

Addition of tensors, 75. 

Angle between two directions, 92. 

in Vay 123-126. 

Angular metric, 123. 

Antisymmetncal systems, 66. 

Area, clement of, 99. 

Associated tensors, 95. 

with respect to quadric, 96. 

— vectors, 140. 

Atom, vibration of, 400. 

Attracting mass, spherical, 419-423. 
Attraction on planet, 369. 
Autoparallelism of geodesic, 104, 140. 

Bending of light lays, 402, 403-408. 
Bianchi's derived vectors, 1 37-1 40. 
Bibliography. See Prefate, 

— of solutions of Schwarzschild type, 
439 - 

Bilinear covariant, 18, 20, 21. 

— form, 66, 70. 

Binary form, curvature of, 4 14-4 19. 

Canonical form ol quadratic, 206, 281. 

— system as to given congruence, 278. 
Cartesian co-ordinates, 61. 

— co-ordinates locally, 164, 167, 171, 
202. 

Central forces, 397. 

Change of co-ordinates, 61. 

— variables, 2, 61. 

general, 80. 

Characteristic of envelope, 101. 

matrix, 9, 39, 87, 250. 

surfaces, 47. 

ChristofTel’s symbols, derivatives of 
in terms of, ni. 

determinant a in terms of, 1 1 2. 

of first kind, 109, i ii. 

of second kind, no, in. 

vanishing of, 121. 

Classical mechanics, correction to, 392. 


meaning of constant, 335. 

Coefficient second approximation to, 
392-394- 

Coefficients of by experiment, 363- 
368. 

qualitative properties of, 369. 

quadratic form, covariance of, 73, 

Commutation of second derivatives, 

273. 

rule, 184. 

Complete system of partial differential 
equations, 52, 53. 

— total differential equations, 15- 

j8. 

Composition of tensors, 79. 

velocities, in relativity, 317. 

C'ompounded tensor, 79. 

Conformal representation, 229, 246. 

— — of Einsteinian in Euclidean space, 

423- 

of Ka, 41 1. 

Congruence, canonical 8\ stern as to, 278. 

— geodesic, 262, 274. 

-- curvature of, 275. 
normal, 263, 275, 285. 
of curves, 46, 47. 
solenoidal geodesic, 363. 

Congruences in Euclidean space, 282. 

~ set of normal, 277. 

- ' sets of, 265. 

Con.stant, gravitational, 386. 

— K, in Einstein's equations, 387. 
Continuity, equation of, 347, 349"35^» 

3f>o, 363- 

molecular, 361. 

Continuous system, mechanics of, 347, 
352 . 

— — in covariant equations, 348, 349. 

— — in terms of energv tensor, 351. 

— — relativistic equations for, 359. 
Contraction of bodies in motion, 313, 

— of tensors, 77-79. 

Contravariance of the n** ’s, 92. 
Contravariance, m-fold system, 69*71. 
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Contravariance, simple system, 67, 81. 

— transformation by, 67, 69—71. 
Contravarialit differentiation, 149. 
Co-ordinate hypersurfaces, angle be- 
tween, 128. 

— lines, angle between, 128. 

parameters of, 98. 

moments of, 98. 

Co-ordinates, Cartesian, 61. 

— purvilinear, 6i, 87. 

— in space-time, 290. 

— locally Cartesian, 164-171, 202. 

* geodesic, 164, 167, 171. 

— PliScker’s, 68. 

Cosmological interest, solutions of, 426. 

— term, Einstein’s, 438, 439. 
Covariance, m-fold system, 69—71. 

— simple system, 67, 68, 82. 

— transformation by, 64, 67-71. 
Covariant, bilinear, 18, 20, 2i. 

— derivative, 146. 

second, 184. 

— differentiation, 144, 149. 

— - — of invariant, 147. 

of vector, 147. 

— simple system, typical, 82. 

systems, sets of, 74. 

Cramer’s rule, 54, 55. 

Curl of vector, 16 1. 

Curvature, calculated, 413. 

— constant, canonical forms for mani- 
folds of, 238, 240. 

manifolds of, 236, 246. 

— Gaussian, 172. 

- — of ordinary surface, 193. 

— geodesic, I3.S-“137- 

of congruence, 275. 

— hypersphcrical, 258. 

formula for, 260. 

— lines of, 286. 

— locally constant, 235. 

— mean, of physical space, 382. 

of 258. 

of Ka, 372. 

— of a Ka, 201. 

calculated, 413. 

formula for, 203. 

— of space, in Einstein’s space-time, 
429, 43*^- 

— principal, in Ka, 204. 

' — Riemannian, of Vn, 195. 

- space of constant, 425 -■439- 
Curvatures, calculation of, 414—419. 
Curvilinear co-ordinates, 6 t, 87. 
Cylindrical space-time, Einstein’s, 429. 

Density, mean cosmic, of matter, 426, 

439. 

— of energy, 350, 351. 

momentum, 350. 


Density of nebulae, 439. 

Derivative, covariant, 146. 

— of vector, 139, 140. 

Derivatives, transformation of, 85. 

— of determinant a, 112- 

De Sitter’s space-time, 429-435. 

— constant negative curvature of, 

436. 

Determinant, functional, 2, 4-12. 

— <2, derivatives of, in terms of Chris- 
toffel’s symbols, 112. 

— reciprocal elements in, 54, 55, 80, 81. 
Developable, circumscribed, 10 1, 

— surfaces, 100, 101. 

Differential equations, linear partial, 
33--61. 

normal form of, 36. 

-- total, 13-33* 

— parameter, first, 231. 

— parameter, second, 154, 393. 

— parameters, in special case, 418. 

-- total, 13, 64. 

Differentials, linear transformation of, 
80. 

Differentiation, covariant, 144, 149. 
Direction. See Versor. 

— spacehke, 356, 357. 

— timelike, 356, 357. 

Directions of co-ordinate lines, in K//, 

127. 

surfaces, in 127. 

specification of, 90. 

Discrete system, motion of, 360-363. 

incompressible, 361. 

Discriminant of u^i’s, 157. 
Displacement, cyclic, 173, 186. 

— infinitesimal, 104. 

— parallel, 103. See Parallel displace- 
ment. 

— spectral, 400. 

ds^ and gravitation, 375. 

— approximately pseudo - Euclidean, 

383* 

— coefficients of, and gravitational 
experiments, 367. 

— l>y experiment, 363-368. 

— covariance of coefficients of, 90. 

— discriminant of coefficients of, 90. 

— expression for, 88. 

for an Einsteinian space-time, 392. 

" assigned Newtonian field, 388- 

392. 

single point mass, 419-423. 

— generalization of, 320. 

— indefinite, 141. 

— in mechanics, 293. 

invariance of, 294. 

•- invariance of, 308, 311. 

— qualitative properties of coefficients 
of, 369. 



SUBJECT INDEX 447 


ds^ same for dynamics and lights 336, 

337. 

— statical, 326, 327, 371, 377, 378.392- 

Einstein tensor in, 380, 381, 

Riemann’s symbols in, 379. 

— ten functions involved in, 322. 

— vanishing, for light, 332, 338. 

variational principle for, 340. 

Divergence of double tensor, 154. 

gradient of vector, 154. 

stress tensor, 344. 

vector, 153. 

Dual variables, 68, 81. 

Eclipse observations, 407, 408. 
Einstein’s cylindrical space-time, 429. 

— form of Hamilton’s principle, 291. 

— gravitational equations, 376. 

— tensor, 200, 371. 

— . — divergence of, 371. 

for a V2, 37*- 

— — in statical ds‘^, 380, 381. 

— — linear invariant of, 371. 
Einsteinian, and Newtonian, trajec- 
tories, 395, 

— motion, law' of time in, 396. 

— space-time, ds'^ for an, 392. 
Electrodynamics of bodies in motion, 

311, 312. 

Ivlement of area, 99. 

Elongation, relative, 305. 

Empty space, 392. 

ds^ for, 382, 383. 

Energy and mass, 294, 298. 

— - -- matter, 298. 

' — metric of F4, 328. 

- density, 350, 351, 356, 382. 
field of uniform, 426. 

— flux of, 350, 351, 356, 357, 358. 
intrinsic, of matter, 297, 298. 

— kinetic and potential, 296, 324. 
tensor, 355, 358. 

- and curvature, 374. 

— electromagnetism, 374. 

— « equations of motion, 359. 

— local phenomena, 374. 

metric of space-time, 383. 

physical interpretation of, 359. 

vanishing, 382, 392. 

Envelope of family of planes, 101. 
Equation of orbit, 397. 

Equations, Einstein’s gravitational, 376. 

— of motion, Einsteinian modification 
of, 35 1. 

in terms of stress tensor, 351. 

of free particle, 287. 

— relativistic, for continuous system, 
359 - 

— total differential, i3“33. 

— integrals of, 47. 


Equipollence of vectors, 103. 
Equivalence, mechanical, a theorem ofi 
394 - 

6-systems, 158. 

Ether, 335. 

Euclidean manifold, 121. 

and Riemann’s symbols, 242-246' 

— Christoffers symbols in, 121. 

— metric manifold, condition for, 242- 
246. 

Experiment and coefficients of dr®, 363— 
368. 

Experiments, gravitational, and ds®, 367* 

— optica], and df®, 363. 

Extension of a field, 160. 

curved space, 426, 427, 428, 

Facet, 201. 

Fermat’s principle, 335, 402. 

and geodesics, 341-343. 

in relativity, 340, 341. 

Focal directions in congruences, 283. 
Force absorbed in stress system, 349. 
374 . 375 - 

— disturbing, in planet’s motion, 397. 

— in relativity field, 328. 

— inversely as cube of distance, 397- 
Form, bilinear, 66, 

— invariant, 73. 

— linear, 67. 
multilinear, 66, 83. 

— quadratic, 66. 

F'ornis of class 1, 253. 

— conditions for, 257. 

Frame of reference, 333. 

Frequency of spectral line modified, 

400. 

Fresnel’s convection coefficient, 319. 

I — ' formula for velocity of light in 
moving media, 318-320. 

Function, alternate, 35. 

implicit, 3. 

— uniform, 14. 

Funcnonal matrix, characteristic of, 
9, 39. «7. 250. 

Functions of position, 80, 83. 

Galilean systems, 349. 

force, stress, and divergence in, 

349 - 

Gauss, on intrinsic geometry, 99. 
Gaussian curvature, 172. 

of Vz, 193. 

General relativity, concept of, 294. 

postulates of, 364. 

Generalization of metric of F*, 320. 

Lagrangian function, 322-324. 

Geodesic, co-ordinates locally, 164, 167, 
171. 

— curvature, I 35 "J 37 - 
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Geodesic, definition of, 103, 128. 

— deviation, 208-220. 

• Jacobi’s formula for, 219. 

— excess, 197. 

— manifold, 162. 

— ■ motion of particle in, 326. 

— principle, 337. 

— — Einstein's, 328, 331. 

in V4, 341* 

— sphere, as horizon, 434. 

— ■ surface, 164. 

— triangle, 197. 

Geodesics and trajectories, 324, 326, 
331- 

— autoparallelism of, 104, 140. 

— differential equations of, 131“! 35. 

— in rigid motions, 408. 

— in space of constant curvature, 428. 

— l.agrange’s equations for, 208, 331, 

33^ 341. 367, 413- 

— near gK^'en geodesic, 208. 

— of zero length, 330-334, 337- 
Geometrical optics, 334. 

— - - according to Einstein, 335-338. 
Oiic (Einstein tensor), in statical ds-, 

380, 381. 

— — linear invariant G of, 380, 

381. 

G, linear invariant of Einstein tensor, 
380, 381. 

Gravitation, modification of Newton’s 
law of, 397. 

— not absorbed in energy tensor, 375. 

— with point mass, 419-423. 
Gravitational constant, 386. 

equations, and the fact.s, 387. 

Einstein’s, 376. 

' - — for spherical symmetry, 419. 

— - for statical 381. 

in space of constant curvature, 

428. 

— - modified by cosmological term, 
438. 

* rigorous solutions of, 437. 

solution of, 4 1 9-42 ”5. 

— experiments and 367. 

-- field and spectral lines, 400-402. 
path of light in, 403-408. 

— forces, as privileged, 374. 

— tensor, 371, 372. 

divergence of, 372. 

Hamilton’s principle, 287. 

■ Einstein’s form of, 291, 

modified, 294-298, 301, 322-324, 

351. 

Horizon, in De Sitter’s space* time, 434. 
llyperspherical representation, 258. 
Hvpersurface, 12 1. 

■ — hypcrspherical representation of, 258. 


Hypersurfaces in Euclidean space, 249, 

253. 

— parallel, 251. 

Immersion of Vn in Euclidean space, 
121. 

Indefinite 141. 

Independence of functions, 5, 8-10. 
Inertia, index of, 299. 

— principle of, in relativity, 298. 

Inner multiplication of tensors, 79. 
Integral, general, 40, 42, 43, 45, 50- 

— independent, 40, 42. 

— of differential equations, 36, 37. 

— principal, 38, 39f 49- 
Intrinsic geometry of surface, 99. 
Invariance and Hamilton’s principle, 

291. 

- in relativity , 322. 

— of 308, 311. 

— ■ m-fold sys^tem, 69. 

' — simple system, 67, 

— transformation by, 62. 

Invariant, dernatives of, 83. 

— quadratic form, 73, 84. 

Isotropic manifolds, 232. 

Jacobi on geodesics, 208. 

Jacobian .systems of equations, 52, 53. 
Jacobians, 2. See Determinant, futu- 
tionaJ. 

Kinematics, Galilean, 318. 

* of rigid systems, 301. 

modified, 303. 

— relativity, 311, 316, 

Kummer on congruences, 286. 

I.agrange and geodesics, 208, 331, 332, 
341, 3^7, 413* 

Lagrangian binomials, 289. 

— equations, 289, 331, 332, 341, 367, 
413- 

— parameters, 288. 

Laplace’s operator, 394. 

Law of gravitation, modifications of, 397. 
Light, constancy of velocity of, 335. 

— in gravitational field, frequency of, 
400-402. 

— path of, as trajectory, 403. 

- in gravitational fielil, 403-408. 

— ' propagation of, reversible, 365. 

- rays and trajectories, 343. 

— - signals, 364. 

Ivocal time, 290, 31 1, 312. 

Lorentz transformation, 300, 308, 310, 
316. 

invariance for, 352, 353, 354- 

— — most general, 313. 

— translation, 316. 
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Manifold, i. 

— Euclidean, 121. 

— geodeftic, 162. 

— metric, 1 1 9. 

— 72-dimensionaI, iiy. 

— <«ections of, 162. 

Manifolds, isotropic, 232. 

— of constant curvature, 236, 238, 240, 
246. 

their mutual applicability, 249. 

Mass and energy, 294, 298. 

— and metric of V4, 328. 
and velocity, 295, 

Matrices, functional, 8-12. 

Matter, mean cosmic densiiv of, 439. 

— total quantity of, 427, 428. 

Maximum and minimum, 128. 
Maxwell’s theory, 383. 

Mayer’s method of integration, 25. 
Mechanical equivalence, a theorem of, 
394- 

Mechanics, classical, correction to, 291- 

294, 320, 392. 

— generalized, 320-324. 

— of continuous systems, 347, 352. 

- — in covariant equations, 348, 349. 

. - — with any co-ordinates, 347. 
Metric, angular, 123. 

— of space-time and energy tensor, 383. 

— of 1^4 and physical phenomena, 374. 
generalization of, 320. 

— - pseudo-Euclidean, 299, 360, 

Metrical elements of figure, 100, 
Metrics, different, covariant derivatives 

for, 222. 

-- — for same Vn, 220. 

Riemann's symbols foj, 224. 

— in conformal representation, 229. 

— relativity, qualities of, 325. 

statical, 326. 

stationary, 326. 

— spatially uniform, 425. 

— with spherical symmetry, 408-414 
Michelson-Morley experiment, 335. 
Minimum time, principle of, 341, See 

Fermat's Principle. 

Mixed system, or tensor, 70, 71. 

— systems of total differential equations, 
29-'33- 

Molecular action, system with no, 360- 

363- 

Moments of co-ordinate lines, 98. 

direction, 92, 120. 

covariance of, 92, 120. 

— relation connecting, 92, 120 

Momentum, 295. 

Morera’s method of integration, 22-25. 
Motion, Einsteinian, of planets, 396. 
Multilinear form, 66, 69, 83. 
Multiplication of tensors, 76, 
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Nebulte, density of, 439. 

Newtonian equations, 287, 377. 

— field, assigned, space-time for, 388- 
392. 

— motion, differences from Einsteinian, 

377. 

— potential, 375. 

— potential and dr®, 336, 369. 

Normal congruence, 263, 275, 277, 285. 

— form of differential equations, 36, 

(.Iperator A, properties of, 176. 

— linear, 33-37, 48. 84. 

Optics, geometrical, 334. 

Orbit, equation of, 397. 

Orthogonal directions, sets of, 205. 

Parallel, ambiental, 17 1. 

— displacement, 103. 

-- — along a geodesic, J03, 104. 

angles unchanged by, 103, 1 14. 

-- — cyclic, 173, 186. 

— of vector, 192. 

Peres’s formula for, 193. 

Parallelism, 102. 

— and curvature, 193-198. 

— and infinitesimal displacement, lO/f , 

— angle of, 19H. 

— differential definition of, 105. 

— equations of, 1 10-112. 

“ extension of notion of, 137* 

- intrinsic character of, 106. 

“ intrinsic equations of, 107. 

invariance of, 110. 

— symbolic equation of, 107. 

■ with respect to surface, 102. 
Parallelogram rule for vectors, 117. 
Parallels, kinematical construction of, 

102, 104. 

Parameter of family of surfaces, 45. 

— first differential, 231, 418. 

— second differential, 154, 393, 418. 
Parameters and moments, relation of, 

92, T25. 

— J>agrangian, 288. 

— of co-ordinate lines, 98. 

— of direction, 91, 120. 

■ - ■ ' contra variance of, 91, 120. 

relation connecting, 91, 120- 

Parametric equations of surface, 86. 
Path of light, in gravitational field, 403, 

403-408. 

Pdlf^s’s formula, 193. 

Perihelion, displacement of, 396, 398. 

-- formula for, 398. 

of Mercury, 399. 

of other planets, 400. 

Permutability (d5 — 5d), 116. 
Perturbations, Newtonian, 399. 

Pfaffian, 13, ao, 26, 161, 174 


..M 
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PfaJRFian as invariant 8i, 82. 

- — systems, 14. 

Physical phenomena and metric of P'4, 
374* 

Planet, motion of, 369. 

Planets, Einsteinian motion of, 396. 

— motion of, discrepancies in, 322. 
Poisson’s equation and Einstein’s theory, 

386, 387. 

— equation for potential, 375, 377. 

— parentheses, 35, 36. 

Postulates of general relativity, 364. 
Potential, Newtonian, 287, 292, 297, 

322* 323. 369. 375, 377, 388, 394, 396, 
400, 403. 

and ds^, 336, 369. 

— — and metric of space, 391. 
Potentials, 10 gravitational, 375. 
Principe expedition, 407. 

Product of tensors, 76. 
Pseudo-Euclidean ds'^, 325, 376. 

- — every metric locally, 360. 

— metric, 299, 360, 383. 

and versors, 329. 

Quadratic, canonical form of, 205, 281. 

— differential form, invariant, 84. 

— form, 66. 

covariance of coefficients of, 73. 

definite, 90. 

— ■ — ds^f character of, 120. See 

— — invariant, 73. 

— — with non-vanishing discriminant, 
90. 

— forms of class zero, 242. 

of class i, 253. 

Euclidean, 242. 

— — Riemann’s symbols for, 242-246. 
theory of, 205. 

Radioactivity, 297. 

Radius of universe, 439. 

Reciprocal elements in determinants, 54, 
55. 80. 81. 

— tensors, 95. 

Refracting medium, space as, 402. 
Refraction of light, 334. 

Refractive index, 334, 

Relative motion, 313, 316. 

Relativity and Newtonian theory, differ- 
ences, 377. 

— composition of velocities in, 317. 

— general, and Poisson’s equation, 386, 

387. 

postulates of, 364. 

— invariance in, 322. 

— kinematics of, 311, 316. 

-- metrics, qualities of, 325. 

■ statical, 326. 

— — stationar>s 326. 


Relativity, postulates of, 364. 

— principle of, 31 1. 

— restricted, 300. 

— special theory of, 300. 

Reversible motion, 327. 

— transformation, 3, 7, 61. 

Reversibility of light propagation, 365. 
Ricci’s coefficients of rotation, 268. 

— lemma, 148, 152. 

— symbols, 199, 372, 389, 411, 426. 

— tensor, 199. 

linear invariant of, 200, 380. 

Riemann- Chris toff el tensor in K4, 372. 

30 components of, 372. See 

Riemann* s symbols, 

Riemannian cun'ature of Vn, 195-198. 
Riemann’s symbols, 172. 

and conformal representation, 228, 

246. 

and Euclidean metric manifold, 

242-246. 

-- Bianchi*s identities in, 182. 

-- — of first kind, 176, 179-182. 

— — of second kind, 175, 177, 178. 
Rigid motion in any manifold, 408. 
Romerian units, 307. 

Rotation, Ricci’s coefficients of, 268. 
Rotor of vector, 161. 

Saturation (of indices). See Contraction. 
Scalar product of vectors, 98, 126, 152. 
Schur’s theorem, 235. 

Schwarzschild’.s solution gravitational 
equations, 419-423. 

extensions of, 439. 

Second covariant derivatives, 184. 

— differential parameter, 154, 393. 

— fundamental form of Vn* 252. 

Section of manifold, 163, 

— of Va, 201. 

Sets of orthogonal directions, 205. 

— of simple systems, 74, 156. 

— reciprocal, 74. 

Severi’s theorem, 171. 

Shift, spectral, 400. 

Signals, light, and coefficients of ds^^ 
364-366. 

Simultaneity, 290, 311. 

Sirius, spectrum Companion of, 402. 
I Sobral expedition, 407. 

I Solution of differential equations, 36, 

1 48. 

— — gravitational equations, Schwarzs- 
child’s, 419—423. 

first approximation deduced 

from, 425. 

Solutions, rigorous, of gravitational 
equations, 437. 

Space, metric of, and Newtonian poten- 
tial, 391. 
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Space, non*£uclidean, 391, 

— of constant curvature, extension of, 
426, 427, 428. 

— ' — gravitational equations in, 428. 
Space-time, 290. 

— an Einsteinian, for, 392. 

— co-ordinate transformations, 290. 

— De Sitter’s, 429-435. 

— De Sitter’s, constant negative curva- 
ture of, 436. 

— Einstein’s and De Sitter’s, case in- 
cluding, 435. 

— Einstein’s, curvature of space in, 438. 

— Einstein's cylindrical, 429. 

— metric, and energy tensor, 374. 

— vvith assigned Newtonian iield, 392. 
Spatially uniform metrics, 425- 
Spectral displacement, 400. 

Sphere, geodesic, in Kg, 409, 410. 
Spherical symmetry and gravitational 
equations, 419. 

metrics with, 408-414. 

Statical ds", 326, 327, 37i, 377. 378, 392. 

— field, 400. 

— metrics, 326, 327. 

Stationary metrics, 326, 327. 

— — Fermat’s principle for, 340. 

StrLv 344. 

— and bilineai foim, 345. 

— force absorbed in, 349, 

— in spatially symmetrical metrics, 425, 
429, 430, 43 (\ 437, 438- 

— kinetic, 351, 356, 358. 

— normal,. 345. 

— tensor and equations of motion, 
35 

divergence of, 344, 346, 354. 

in classical theoiy, 344. 

in generalized co-ordinates, 346. 

interpretation of divergence, 346. 

Sum of tensors, 76. 

Surface, geodesic, 164. 

— ' intrinsic geometry of, 99. 

— — parametric equations of, 86. 

- vectors, 96. 

.Surfaces, developable, 100,101. 
Symmetrical double systems, 72. 

— systems (tensors), 65. 

Symmetry, spherical, and gravitational 

equations, 419 

metrics with, 408-414. 

System, mixed, 70, 71. 

Systems (tensors), antisymmetrical, 73. 

— double, 65. 

— m-fold, 65. 

— of order w, 65. 

— of order zero, 65. 

— symmetrical, 65, 

Tensor, 70, 71. 
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Tensor, Einstein’s, 200, 371. Sec Ein» 
stein* s tensor. 

— energy, 355- 

and equations of motion, 359. 

— first general definition of, 80. 

— gravitational, 371. 

10 components of, 372. 

— Riemannian, 371. See Riemann*t 
symbols. 

— second general definition of, 83. 

- stress, divergence of, 344, 346, 354. 
with vanishing elements, 71. 

Tensors, addition of, 75, 

— associated, 95, 96. 

— composition of, 79. 

— contraction of, 77—79. 

— inner multiplication of, *70. 

— multiplication of, 76. 

— reciprocal, 95. 

Third fundamental form of F,*, 259. 
Time, conventional, 364. 

— local, 290, 311, 312. 

I'otai diflerential, 13. 

equations, 13--33. 

equations, complete system of> 

15-18. 

Trajectories, 403. 

— and geodesics, 324, 326. 

— and light rays, 343. 

— - Einsteinian and Newtonian, 395. 

— in generalized mechanics, 324. 

— orthogonal, 263. 

Transformation, affine, 304, 305. 

— by contravariance, 67, 69-71, 

— by covariance, 64, 67-71. 

— by invariance, 62. 

— tormuLv of, 80. 

— homographic, 304 

— linear, of differentials, 80. 

— of deriv'^atives, 85. 

Transformations, linear, 67. 

— Lorentz, 300, 308, 310, 316. 

— " reversible, 3, 7, 61, 

— - Foace-time co-ordinate, 290. 
Translation, motion of, 305. 

Universe, radius of, 439. 

Variations, Point arch’s equation of, 208. 
Variety. See Manifold. 

Vector, contrcivariant and covariant 
components of, 97. 

in 120, 127. 

— derivative of, 139, 140, 

— determination of, by invariants, 266. 

— product, 159. 

— product of versors, 201. 

— projection of, in F,*, 127* 

— transfoimation of, 62, 63, 64. 

Vectors, equipollence of, 103. 
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V«<^ors, pjiraUel and equal* i03.„ 

— ^ scalar j^oduct of, 98, I26i 
— ^ surfac^ or tangential, 96, 

— aero, 

Velocicica^ absolute and relative, 316- 

— eomppsition of, 306. 

r- according to Einstein, 317, 

Velocity, earth*8 orbital, 399. 

— large universal constatfl, 291, 292, 
3H- 

— mass and, 295. ^ 

— of light, aga, 306, 311, 334. 335, 339. 
382, 399. 

^ — irrisversible, 340. 

law of variation of, 339. 

— — non-ay mmetricai, 340, 


Versot (unit vector* direction), 92, 96, 
98, 102, 103, 123, 125, 126, 140. 

— in f^4, and corresponding vector, 
329^ 

Versors, and pseudo-EucUdean metrics, 
329. 

— spacelike, 330. 

— timelike, 330- 
Vibration of atom, 400. 

Volume of curved space, 427, 428* 

World lines, 290, 329, 352, 353. 

of light, 337, 364. 

parameters of, 353* 

and Stress tensor, 353. 

Zero vectors, 97. 








